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1 
I n t r o d u c t i o n 


T h i s book is intended as a guide t o d i g i t a l a u d i o f o r 
o p e r a t i o n a l sound s t a f f . 
Its aim is t o provide a s u f f i c i e n t 
i n s i g h t i n t o t h e p r i n c i p l e s t o enable c u r r e n t and f u t u r e d i g i t a l 
equipment t o be used more c o n f i d e n t l y and e f f e c t i v e l y . 


D i g i t a l a u d i o is developing f a s t w i t h a s t e a d y flow of new 
products, e a c h of which w i l l probably only have a r e l a t i v e l y 
s h o r t l i f e t i m e . 
It is revealing t o compare t h e PCM-F1 d i g i t a l 
recording system (now becoming o b s o l e t e a f t e r only about nine 
y e a r s ) w i t h 1/4" analogue t a p e (which c l e a r l y has many y e a r s of 
l i f e l e f t ) , 
F o r t u n a t e l y f o r t h o s e t r y i n g t o understand t h e 
s u b j e c t , a l l d i g i t a l a u d i o systems s h a r e much common underlying 
t h e o r y and have t o overcome s i m i l a r problems. 


However, t h e understanding of d i g i t a l a u d i o is o f t e n made more 
d i f f i c u l t by t h e unnecessary use of jargon words - ' w r i t e ' 
i n s t e a d o f 'record' f o r example. 
This book w i l l attempt t o 
avoid, o r where necessary e x p l a i n , t h e s e buzzwords, 


Chapters 2 t o 4 cover t h e background t h e o r y and Chapter 5 goes 
on t o d e a l w i t h d i g i t a l recording. 
While t h e u l t i m a t e g o a l of 
a n a l l - d i g i t a l programme c h a i n is probably many y e a r s away, 
Chapters 6 h 7 look a t two of t h e important l i n k s : d i g i t a l audio 
s i g n a l processing and t h e i n t e r c o n n e c t i o n of d i g i t a l systems. 
Chapter 8 c o v e r s broadcasting systems. 


Further reading: 


The following books a r e suggested f o r t h o s e wanting a deeper 
coverage of t h e s u b j e c t : 


'The A r t of D i g i t a l Audio1 by John Watkinson. 
Focal Press. 


' P r i n c i p l e s of D i g i t a l Audio' by Ken C Pohlmann. 
Boward Sams. 


' D i g i t a l Audio & Compact D i s c Technology' by t h e s t a f f of t h e 
Sony S e r v i c e Centre (Europe). 
~ e i n e m a n n / ~ e w n e s , 


'The Compact D i s c : A Handbook of Theory and U s e ' 
by Ken C Pohlmann. 
Oxford University P r e s s , 


'Stereo Sound f o r Television' by F r a n c i s Rumsey. 
Focal Press. 


Analogue and D i g i t a l 


ANALOGUE SIGNALS 


An a u d i o s i g n a l has t o convey e v e r y d e t a i l o f a sound wave as a 
v o l t a g e that v a r i e s e x a c t l y i n sympathy w i t h t h e o r i g i n a l a i r 
p r e s s u r e changes. 
It is v e r y complex and is known as a n 
a n a l o g u e s i g n a l - t h e s i g n a l v o l t a g e is a n analogue of a i r 
p r e s s u r e . 
Any i m p e r f e c t i o n i n t h i s a n a l o g y w i l l impair t h e 
reproduced sound q u a l i t y . 
An analogue s i g n a l c a n have any v a l u e 
between z e r o and t h e d e f i n e d peak l i m i t s of t h e system. 
I t a l s o 
has p o s i t i v e and n e g a t i v e v a l u e s - j u s t as t h e o r i g i n a l sound 
wave had compressions and r a r e f a c t i o n s : 


There are two c h a r a c t e r i s t i c problems of an analogue a u d i o 
s i g n a l : 


1 ) 
Any d e g r a d a t i o n s u c h as added n o i s e o r d i s t o r t i o n 


w i l l a f f e c t t h e sound q u a l i t y b u t w i l l be i m p o s s i b l e 
t o d i s t i n g u i s h from wanted s i g n a l ( i m p o s s i b l e f o r a 
p i e c e of equipment t h a t is - u n f o r t u n a t e l y o u r ears 
d i s t i n g u i s h a l l t o o e a s i l y ) . 
It is t h e r e f o r e n o t 
p o s s i b l e t o remove t h e n o i s e o r d i s t o r t i o n . 


2 ) 
The e f f e c t of t h e s e d e g r a d a t i o n s , however small, 


accumulates w i t h e a c h p i e c e of equipment o r 
g e n e r a t i o n of r e c o r d i n g t h a t t h e s i g n a l p a s s e s 
through. 


When t h e s i g n a l is recorded, t h e r e a r e two f u r t h e r analogues 
i n c o l v e d . 
The v a r i a t i o n s i n s i g n a l v o l t a g e become v a r i a t i o n s i n 
magnetic f l u x , w h i l e t i m e c o n v e r t s t o d i s t a n c e a l o n g t h e t a p e . 
I f t h i s d i s t a n c e is n o t a p e r f e c t analogue of t i m e t h e r e s u l t is 
wow and f l u t t e r - which a l s o c a n n o t be removed and accumulates 
w i t h e a c h g e n e r a t i o n of r e c o r d i n g . 


Analogue t e c h n i q u e s , p a r t i c u l a r l y i n r e c o r d i n g , have now been 
r e f i n e d t o t h e p o i n t where f u r t h e r improvement is d i f f i c u l t and 
e x p e n s i v e . 
A new approach i s needed. 


DIGITAL SIGNALS 


A d i g i t a l a u d i o s i g n a l c a r r i e s i n f o r m a t i o n a b o u t t h e sound wave 
i n t h e form of a s e r i e s of measurements - o r samples - of t h e 
analogue s i g n a l t a k e n a t r e g u l a r i n t e r v a l s and s u f f i c i e n t l y 
o f t e n t o d e s c r i b e t h e s i g n a l a c c u r a t e l y : 


225mV 


* 
time 


These measurements must be i n a form s u i t a b l e f o r s e n d i n g a l o n g 
wires o r r e c o r d i n g . 
T h i s is made e a s y i f t h e measurements are 
e x p r e s s e d i n t h e form of b i n a r y numbers. 
( S e e Appendix 1 f o r a n 
e x p l a n a t i o n of b i n a r y numbers). 


I n b i n a r y c o u n t i n g , t h e r e are o n l y two d i f f e r e n t d i g i t s , o r b i t s 
(from - 
b i n a r y d i g i t ) 
- : 0 and 1. 
These c a n be r e p r e s e n t e d 


e l e c t r i c a l l y by two v o l t a g e s , f o r example OV and SV: 


L e t ' s t a k e t h e example of a measurement of 180mV. 
I n b i n a r y 
t h i s is 10110100. 
Represented as a n electrical s i g n a l it would 
l o o k l i k e : 


T h i s is known a s a P u l s e Code Modulation o r PCM s i g n a l . 
I t is a 
v e r y simple s i g n a l , having o n l y two s t a t e s : e.g, 
OV or SV, 
Because t h e r e a r e o n l y two p o s s i b l e states, t h e meaning of t h e 
s i g n a l is n o t a l t e r e d i n any way by moderate amounts of 
d i s t o r t i o n o r added n o i s e : 


A l l t h a t is r e q u i r e d t o c o r r e c t l y i n t e r p r e t t h e s i g n a l is t o be 
a b l e t o d e c i d e whether it is above o r below t h e half-way 
t h r e s h o l d : 


1 
0 
1 
1 
0 
1 
0 
0 


threshold 


As l o n g as t h e c o r r e c t d e c i s i o n c a n b e made, a c l e a n v e r s i o n of 
t h e s i g n a l c a n be r e g e n e r a t e d . 
I n t h i s way it is p o s s i b l e f o r a 
d i g i t a l s i g n a l t o go t h r o u g h any number of g e n e r a t i o n s of 
r e c o r d i n g o r over long d i s t a n c e s w i t h o u t d e t e r i o r a t i o n . 
There 


is a l i m i t t o t h e amount of n o i s e o r d i s t o r t i o n beyond which it 
is no longer p o s s i b l e t o c o r r e c t l y i n t e r p r e t t h e s i g n a l . 
However, provided t h a t t h e s i g n a l is regenerated b e f o r e t h i s 


l i m i t is reached, t h e q u a l i t y of a d i g i t a l audio s i g n a l is 
'sealed i n 8 and is independent of t h e medium used ( t a p e , d i s c , 
l i n e , r a d i o l i n k etc.). 
So a recording w i l l sound e x a c t l y t h e 
same when played from a CD as from a DAT t a p e - assuming 
i d e n t i c a l analogue c i r c u i t r y i n the two machines. 
S i m i l a r l y , 
using a d i f f e r e n t brand of t a p e i n t h e DAT machine w i l l n o t 
a f f e c t t h e sound q u a l i t y i n t h e way it might w i t h an analogue 
c a s s e t t e machine. 


The sound q u a l i t y of any d i g i t a l audio system is determined by 
t h e i n i t i a l measuring process - c a r r i e d o u t i n t h e Analogue t o 
D i g i t a l c o n v e r t e r (ADC), and by t h e e v e n t u a l r e c r e a t i o n of t h e 
analogue s i g n a l - i n t h e D i g i t a l t o Analogue c o n v e r t e r (DAC). 
The p o t e n t i a l q u a l i t y of t h e s e processes is f i x e d a t t h e design 
s t a g e by t h e v a l u e s chosen f o r two important parameters - t h e 
sampling rate and t h e number of q u a n t i s i n g l e v e l s . 
These a r e 
t h e s u b j e c t of t h e next c h a p t e r . 


SUMMARY 


Analogue audio s i g n a l s are complex and can e a s i l y be degraded. 
Any degradation is irremovable. 


D i g i t a l audio s i g n a l s are simple, having only two p o s s i b l e 
states. 
They can be regenerated. 


The q u a l i t y of a d i g i t a l audio system is independent of t h e 
medium. 


Q u a l i t y is determined by t h e conversion processes. 


3 
Sampling & Q u a n t i s i n g 


The p r o c e s s of c o n v e r t i n g an analogue a u d i o s i g n a l t o d i g i t a l 


form is c a r r i e d o u t i n two s t a g e s : Sampling and Q u a n t i s i n g . 


SAMPLING 


The f i r s t s t e p is t o choose t h e p o i n t s on t h e a u d i o waveform 
where its v o l t a g e w i l l be measured - known as sampling t h e 
s i g n a l . 
W e must d e t e r m i n e how o f t e n w e need t o sample t h e 
s i g n a l - i n o t h e r words what sampling rate is r e q u i r e d . Theory 
t e l l s u s t h a t t h e sampling r a t e must be a t least twice t h e 
h i g h e s t frequency p r e s e n t i n t h e a u d i o s i g n a l . 
(This is o f t e n 
r e f e r r e d t o a s t h e Nyquist Theorem a f t e r t h e Swedish-born 
e n g i n e e r Harry Nyquist (1889-19761, who worked a t t h e B e l l 
Telephone L a b o r a t o r i e s i n t h e USA). 
So, i f w e t a k e 20kHz t o be 
t h e h i g h e s t a u d i o frequency of i n t e r e s t , w e must sample t h e 
s i g n a l a t least 40,000 times p e r second. 


A t f i r s t s i g h t t h i s might seem impossibly c r u d e . 
A t t h e h i g h e s t " 


a u d i o f r e q u e n c y ( s a y 20kHz) t h e r e w i l l b e o n l y two samples f o r 
e v e r y c y c l e of t h e s i g n a l (a, below) - a l l i n f o r m a t i o n a b o u t t h e 
s i g n a l between t h e sampling p o i n t s w i l l be l o s t : 


The analogue a u d i o s i g n a l reproduced from t h e s e two samples is a 
s q u a r e wave ( b ) - v e r y d i f f e r e n t from t h e i n p u t . 
However, a 
s q u a r e wave c o n s i s t s o f t h e fundamental frequency p l u s a l l i t s 
odd harmonics ( t h e l o w e s t of t h e s e being 60kHz i n t h i s case), it 
- 
w i l l t h e r e f o r e be p o s s i b l e t o f i l t e r t h e s i g n a l t o remove t h e s e 
unwanted harmonics and l e a v e a p e r f e c t 20kHz s i n e wave (c). 


But what i f t h e o r i g i n a l 20kHz i n p u t s i g n a l w a s n o t a s i n e wave? 
The answer is that it must have been, because i f t h e s i g n a l had 
a n y o t h e r shape it would c o n t a i n higher frequency harmonics. 
We 
would t h e r e f o r e be breaking t h e r u l e of t h e sampling t h e o r y t h a t 
t h e sampling r a t e must be a t least twice t h e h i g h e s t frequency 
p r e s e n t i n t h e a u d i o s i g n a l . 


I f t h e sampling rate is t o o low (i.e. i f f r e q u e n c i e s h i g h e r t h a n 
h a l f t h e sampling rate are p r e s e n t i n t h e a u d i o s i g n a l ) , a n 
e f f e c t known a s a l i a s i n g o c c u r s . 
For example, i f a 15kHz s i g n a l 
i s sampled a t 20kH2, t h e r e s u l t is e x a c t l y t h e same set of 
samples a s i f t h e i n p u t s i g n a l had been SkHz: 


4 
15kHz Input 
5kHz 'Alias' 


Sampling Rate:ZOkHz 


T h i s SkHz s i g n a l is t h e ' a l i a s ' of t h e 1SkHz i n p u t and w i l l 
appear a t t h e o u t p u t of t h e system, 
Note t h a t t h e a l i a s 
frequency is simply t h e d i f f e r e n c e between t h e sampling r a t e and 
t h e i n p u t s i g n a l , s i m i l a r t o a ' b e a t ' frequency formed between 
two audio tones. 
The higher t h e frequency of t h e i n p u t s i g n a l , 
t h e lower w i l l be t h e alias frequency produced. 


The important i m p l i c a t i o n of t h i s is t h a t having decided on t h e 
h i g h e s t a u d i o frequency w e wish t o use, and t h u s t h e a p p r o p r i a t e 
sampling rate, w e must e n s u r e t h a t higher frequencies a r e n o t 
allowed t o e n t e r t h e Analogue t o D i g i t a l converter. 
The i n p u t 
s t a g e of every d i g i t a l a u d i o d e v i c e w i l l t h e r e f o r e c o n t a i n a 
f i l t e r , known a s t h e ' a n t i - a l i a s ' 
f i l t e r . 
To i l l u s t r a t e t h e 
requirements of t h i s f i l t e r it is necessary t o t a k e t h e 
following a l t e r n a t i v e view of t h e sampling process. 


The frequency spectrum of a sampled s i g n a l 


I n s t e a d of r e p r e s e n t i n g audio s i g n a l s a s graphs of changing 
amplitude over t i m e , it is sometimes u s e f u l t o show t h e spectrum 
of t h e s i g n a l . 
This is a g r a p h r e p r e s e n t i n g t h e amplitude of 
t h e s i g n a l a t d i f f e r e n t frequencies. 
A pure 10kHz t o n e shown i n 
t h i s form would l o o k l i k e : 


Frequency 
(kHz) 


w h i l e a f u l l range (20Az - 20kEiz) a u d i o s i g n a l would be 
r e p r e s e n t e d as : 


Amplitude I 


Frequency 
(kHz) 


So what happens when w e sample a s i g n a l ? 
Take t h e c a s e of a 
5kHz t o n e b e i n g sampled a t 40kHz: 


The spectrum of t h i s s i g n a l a f t e r sampling w i l l l o o k l i k e : 


Frequency 
(kHz) 


A 
'Image' at 
'Image' at 


Sampling Rate 


Amplitude 


A s w e l l as t h e i n p u t t o n e and t h e sampling r a t e , e x t r a 'image' 
s i g n a l s have appeared a t t h e d i f f e r e n c e between t h e s e two and a t 
t h e i r sum. 
If the i n p u t were a f u l l range audio s i g n a l from 


20Hz t o 20kH2, t h e spectrum produced would be: 


Sampling Rate 
Sampling Rate 
Input 
minus 
plus 
Frequency 
Audio Frequency 
Audio Frequency 


Sampling Rate 


1 
I 
I 


Amplitude 


Frequency 
(kHz) 


I 
I 
I * 


Audio 
'Image' 
'Image' 
Frequencies 
Frequencies 
I 
Frequencies 


Note t h a t t h e a u d i o f r e q u e n c i e s and t h e image f r e q u e n c i e s j u s t 
meet a t 20kEz. 
I f the sampling rate were t o be any lower t h e y 
would o v e r l a p - t h i s is when a l i a s i n g occurs: 


Amplitude I 


Frequency 
(kHz) 
Aliasing 
Sampling Rate 
(too low) 


The purpose of t h e a n t i - a l i a s f i l t e r is t o keep t h e s e two bands 
of f r e q u e n c i e s a p a r t by l i m i t i n g t h e i n p u t audio frequencies t o 
a maximum of half t h e sampling r a t e . 
I f it were p o s s i b l e t o 
produce a p e r f e c t f i l t e r w i t h an i n f i n i t e l y s t e e p c u t o f f ( t h e 
mythical 'brick-wall1 f i l t e r ) , then w e could use a sampling r a t e 
of j u s t twice t h e highest audio frequency: 


Frequency 
(kHz) 


lmpossi bly steep 


A 
filter required 


However, s i n c e it is not p o s s i b l e t o produce a p r a c t i c a l f i l t e r 
w i t h t h i s response, we must use a higher sampling r a t e t o allow 
space between t h e highest audio frequency and half t h e sampling 
r a t e f o r t h e f i l t e r t o r o l l o f f more gently: 


Amplitude 
to prevent Aliasing 
------------------ 1 


I ;/ 


I 
I 
I 
I 
1 
I 
* 


Filter has 
'space' to roll off 


t 
Frequency 
(kHz) 
Increased 
Sampling Rate 


Amplitude 


I t is important t o note t h a t sampling is a t h e o r e t i c a l l y p e r f e c t 
process - once w e have chosen a s u f f i c i e n t l y high sampling r a t e , 
allowing f o r t h e a n t i - a l i a s f i l t e r , t h e r e i s no b e n e f i t i n 
i n c r e a s i n g it f u r t h e r . 


.------------------ 


P r a c t i c a l Sampling Rates 


I 
1 
I 
I 
1 
I 
I * 


The p r e c i s e choice of sampling r a t e i s complicated by two 
f u r t h e r f a c t o r s : c o s t and t h e e x i s t i n g video frame r a t e s . 


Cost is important, s i n c e t h e higher t h e sampling r a t e , t h e more 
samples w i l l have t o be recorded o r t r a n s m i t t e d . 
High sampling 
rates w i l l t h e r e f o r e l e a d t o increased t a p e consumption o r t h e 
need t o r e n t more c a p a c i t y i n t r a n s m i s s i o n l i n k s . 
Cost i s a 
primary concern of b r o a d c a s t e r s and s i n c e t h e highest a u d i o 
frequency t r a n s m i t t e d is g e n e r a l l y 15kEz, a sampling rate of 
32kEz is widely used i n broadcasting. 
Examples i n c l u d e s t e r e o 
Sound-in-Syncs 
and NICAM. 


For a p p l i c a t i o n s r e q u i r i n g f u l l 20kEz response, any r a t e 
comfortably higher t h a n 4OkEz could have been chosen as t h e 
s t a n d a r d were it n o t f o r t h e need t o c o - e x i s t w i t h video (and 


f i l m ) frame rates, 
I f d i g i t a l a u d i o is t o be synchronised w i t h 
v i d e o it is d e s i r a b l e t h a t t h e r e should be a whole number of 
a u d i o samples p e r frame. 
It is n o t p o s s i b l e t o f i n d an 
economically low frequency t h a t a l l o w s t h i s completely 
s a t i s f a c t o r i l y f o r a l l frame r a t e s . 
However, 48kHz w a s 
e v e n t u a l l y chosen as t h e i n t e r n a t i o n a l s t a n d a r d f o r p r o f e s s i o n a l 
use. 
T h i s g i v e s an even number o f samples per frame f o r : 


30 f p s 
(1600 samples/frame) 
25 f p s 
(1920 
n 
1 
24 f p s 
(2000 
11 
1 


The o n l y problem is w i t h t h e American NTSC frame rate of 29.97 
which g i v e s 1601.6 samples/frame. 
T h i s c a n be s o l v e d by 
d i s t r i b u t i n g 8008 samples a c r o s s a b l o c k of 5 frames. 


48kHz a l s o has t h e advantage of a s i m p l e 3:2 r e l a t i o n s h i p w i t h 
32kHz which e a s e s c o n v e r s i o n f o r b r o a d c a s t purposes. 


U n f o r t u n a t e l y , b e f o r e 48kHz was decided upon, 44.1 kHz had 
a l r e a d y been chosen f o r Compact D i s c . 
T h i s rate d o e s g i v e an 
even number of samples/frame w i t h 25 f p s v i d e o b u t r e s u l t s i n a 
v e r y awkward number u s i n g NTSC. 
There is t h e r e f o r e a companion 
rate of 44.056kHz f o r u s e w i t h NTSC video. 


So t h e s e are now t h e t h r e e sampling rates of p r a c t i c a l 
importance, w i t h some of t h e i r a p p l i c a t i o n s : 


48 kHz 
DATI DASH, PDI DVTR 


44.1 kHz 
CD, F1, 1610/30, pre-recorded DAT, DASH, PD 


32 kHz 
NICAM, s t e r e o S i S 


Sampling rate c o n v e r t o r s w i l l become a n e c e s s a r y e v i l i n f u t u r e . 


QUANTISING 


The sampled audio s i g n a l is not i n any way a d i g i t a l s i g n a l - it 
is still a s vulnerable t o degradation a s analogue audio. 
Every 
sample can still have an i n f i n i t e number of p o s s i b l e values. 
W e 
must r e p l a c e t h i s i n f i n i t e number by a f i n i t e number of values, 
each i d e n t i f i e d by a binary number. 
It is a s i f we had taken a 
measurement of t h e s i g n a l with a p a i r of d i v i d e r s - we must now 
compare t h e d i v i d e r s e t t i n g a g a i n s t a scale t o o b t a i n a 
numerical value f o r t h e sample. 


The example below shows samples compared a g a i n s t a binary s c a l e . 
Each numbered d i v i s i o n is known as a q u a n t i s i n g level and t h e 
d i s t a n c e between each l e v e l a s a q u a n t i s i n g i n t e r v a l : 


I 
time 


Each sample is given t h e value of t h e n e a r e s t quantising l e v e l 
and any f r a c t i o n ignored. 
A s a result, each sample w i l l be i n 
e r r o r by up t o half a quantising i n t e r v a l , plus o r minus. 
This 
e r r o r is an unavoidable r e s u l t of conversion t o a d i g i t a l s i g n a l 


and is known as q u a n t i s i n g error. 
The smaller we make e a c h 
q u a n t i s i n g i n t e r v a l , t h e smaller t h e q u a n t i s i n g e r r o r w i l l be. 
However, i f w e make t h e i n t e r v a l s smaller w e w i l l need more of 
them t o cover t h e same v o l t a g e range. 
More i n t e r v a l s means 
b i g g e r b i n a r y numbers and t h e r e f o r e more b i t s i n e a c h number. 


A d e f i n e d number o f b i t s making u p a b i n a r y number is known as a 
word. 
( A word o f e i g h t b i t s is g i v e n t h e s p e c i a l name o f byte - 
common i n computerspeak). 
Some d i f f e r e n t word-lengths and t h e 
maximum number t h e y c a n c o u n t up t o are shown below: 


No of b i t s 
Maximum c o u n t 


By choosing t h e number of b i t s used i n t h e system, w e can make 
t h e q u a n t i s i n g e r r o r as s m a l l as w e need. 
The p r e v i o u s 
i l l u s t r a t i o n used 3 - b i t numbers, allowing o n l y 8 q u a n t i s i n g 
l e v e l s ( i n c l u d i n g z e r o ) . 
T h i s r e s u l t s i n a l a r g e q u a n t i s i n g 
e r r o r . 
A 16-bit system would a l l o w 65536 q u a n t i s i n g l e v e l s , 
g i v i n g a v e r y small q u a n t i s i n g e r r o r : 


Notice that we can never totally eliminate the quantising error. 


This means that quantising, in contrast to sampling, is 
inevitably an imperfect process. 


Quantising error can be regarded as an unwanted signal which has 


been added to the programme signal. 
The audible effect of 


quantising error depends on the number of bits used in the 


system and on the signal level. 


With many bits and high level signals the quantising error is 


essentially random and is therefore audible as white noise 


similar t o tape h i s s . 
However, w i t h low l e v e l s i g n a l s o r few 
b i t s , t h e e r r o r ceases t o b e random and becomes r e l a t e d t o t h e 
a u d i o s i g n a l . 
I t is t h e n heard as a c h a r a c t e r i s t i c and 
u n p l e a s a n t ' g r a n u l a r ' d i s t o r t i o n . 


S i n c e t h e maximum q u a n t i s i n g e r r o r is f i x e d a t p l u s o r minus 


- 
h a l f a q u a n t i s i n g i n t e r v a l , it f o l l o w s t h a t t h e higher t h e 
s i g n a l l e v e l , t h e b e t t e r w i l l b e t h e s i g n a l - t o - q u a n t i s a t i o n - 
e r r o r r a t i o . 
T h i s is comparable t o t h e c o n d i t i o n f o r b e s t 
s i g n a l - t o - n o i s e 
r a t i o i n analogue systems. 


I n t h e w o r s t case o f q u a n t i s i n g e r r o r , where t h e s i g n a l is o n l y 
j u s t l a r g e enough t o c r o s s one q u a n t i s i n g l e v e l , it w i l l be 
c o n v e r t e d t o a s q u a r e wave: 


The r e s u l t i n g d i s t o r t i o n is made worse by t h e f a c t t h a t t h e 
s q u a r e wave c o n t a i n s h i g h f r e q u e n c y harmonics. 
These w i l l 
e x t e n d beyond h a l f t h e sampling f r e q u e n c y and w i l l t h e r e f o r e 
produce a l i a s i n g , s i n c e t h e y have been g e n e r a t e d a f t e r t h e 


a n t i - a l i a s f i l t e r . 
The r e s u l t i n g sound is v e r y similar t o 


' b i r d i e s ' on poor FM r a d i o r e c e p t i o n . 


If t h e s i g n a l is s o s m a l l t h a t it d o e s n ' t even c r o s s one s t e p , 
t h e o u t p u t w i l l d i s a p p e a r completely. 
T h i s is most l i k e l y t o be 
heard on s l o w l y decaying n o t e s o r r e v e r b e r a t i o n . 


The number of b i t s used d e t e r m i n e s t h e a v a i l a b l e dynamic range 
of a d i g i t a l a u d i o system. 
Each a d d i t i o n a l b i t employed i n a 
system w i l l a l l o w t h e number of q u a n t i s i n g l e v e l s t o be doubled 
and t h e r e f o r e t h e s i z e of e a c h q u a n t i s i n g i n t e r v a l t o b e halved. 
T h i s w i l l r e s u l t i n h a l f t h e l e v e l of q u a n t i s i n g e r r o r - a n 
improvement i n t h e signal-to-quantising-error r a t i o of 6dB. 
The 
approximate dynamic range o f a d i g i t a l a u d i o system c a n be found 
b y m u l t i p l y i n g 6dB by t h e number of b i t s . 


The number of b i t s r e q u i r e d i n a p r a c t i c a l system w i l l depend on 
whether t h e dynamic range of t h e s i g n a l has been c o n t r o l l e d . 
16 b i t s a l l o w s a dynamic range of roughly 96 dB and w h i l e t h i s 
is q u i t e adequate f o r c o n t r o l l e d s i g n a l s (e.9. d e s k o u t p u t ) , a 
microphone i n p u t on a d i g i t a l d e s k , f o r example, might w e l l need 


20 b i t s - e q u i v a l e n t t o a dynamic range of 120dB. 


Note that i f a d i g i t a l a u d i o system is undermodulated by 6dB, 
t h i s i s e q u i v a l e n t t o reducing t h e number of b i t s by one. 
A 
16 b i t system undermodulated by 6dB i s o n l y as good as a f u l l y 
modulated 1 5 b i t system. 


D i t h e r 


It is p o s s i b l e t o e l i m i n a t e t h e g r a n u l a r e f f e c t of q u a n t i s i n g 
e r r o r by t h e u s e of d i t h e r . 
T h i s is a small amount of n o i s e - 
a p p r o x i m a t e l y e q u a l t o one q u a n t i s i n g i n t e r v a l i n amplitude - 
which is added t o t h e a u d i o s i g n a l b e f o r e t h e Analogue t o 
D i g i t a l c o n v e r t e r . 
Its e f f e c t is t o c o n v e r t t h e ' g r a n u l a r ' 


q u a n t i s i n g e r r o r i n t o a random n o i s e which is much more 
a c c e p t a b l e t o t h e e a r . 
The d i t h e r s i g n a l causes q u a n t i s i n g 
l e v e l s t o be c r o s s e d more f r e q u e n t l y and r e s u l t s i n a high 
frequency s q u a r e wave a t t h e o u t p u t of t h e system. 
The average 
value of t h i s s i g n a l v a r i e s w i t h t h e audio s i g n a l . 
The ear 
performs t h i s averaging o p e r a t i o n and hears t h e o r i g i n a l s i g n a l 
w i t h t h e d i t h e r noise: 


without dither 
with dither 


Note t h a t t h e d i t h e r does n o t simply mask t h e d i s t o r t i o n 
produced by q u a n t i s i n g e r r o r - it a c t u a l l y removes it. 


The use of d i t h e r allows a s i g n a l below one q u a n t i ~ ~ n g 
i n t e r v a l , 
which would otherwise disappear completely, t o be heard. 
The 
e f f e c t is a k i n t o t h a t of b i a s i n analogue t a p e recording, 
except t h a t u n l i k e b i a s , t h e d i t h e r s i g n a l is i t s e l f audible. 


A d i g i t a l a u d i o system w i t h d i t h e r behaves l i k e an analogue 


system w i t h a n o i s e f l o o r , where s i g n a l s below t h e n o i s e a r e 
still a u d i b l e . 


It may w e l l not be necessary t o d e l i b e r a t e l y add d i t h e r . 
The 
amount required f o r a 16 b i t system is approximately 96 dB below 
peak l e v e l , and i n many cases, t h e n o i s e from t h e i n p u t 
a m p l i f i e r o r t h e preceding analogue equipment w i l l be enough. 


DIGITAL TO ANALOGUE CONVERSION 


The o u t p u t of a D i g i t a l t o Analogue c o n v e r t e r is a ' s t a i r c a s e ' 
s i g n a l : 


+ A 


voltage 
- 
time 


The s t e p s r e p r e s e n t t h e high 'image' frequencies. 
To r e t u r n t o 
t h e analogue s i g n a l w e need only t o f i l t e r t h e s e o f f . 
This is 
done by t h e 'anti-image' 
o r ' r e c o n s t r u c t i o n ' f i l t e r . 
It has a 
similar response t o t h e a n t i - a l i a s f i l t e r : 


'Anti-Image' 
or 


A 
'Reconstruction' filter 
Amplitude 


1 
I 
I 
I 
I 
I 
I 


Frequency 
(kHz) 


The r e s u l t i s t h e o r i g i n a l a u d i o s i g n a l : 


+ A 


It might be t h o u g h t t h a t t h e anti-image f i l t e r i s unnecessary, 
s i n c e t h e image f r e q u e n c i e s are beyond t h e r a n g e o f h e a r i n g . 
However, i f t h e y are n o t removed t h e y c o u l d r e s u l t i n wasted 
a m p l i f i e r power and damaged tweeters. 
If t h e s i g n a l is r e c o r d e d 
on a n analogue t a p e machine t h e image f r e q u e n c i e s c o u l d b e a t 
w i t h t h e b i a s s i g n a l t o produce a u d i b l e t o n e s . 
Also, a n y 
i n t e r m o d u l a t i o n d i s t o r t i o n i n f o l l o w i n g equipment c o u l d r e s u l t 
i n a u d i b l e f r e q u e n c i e s b e i n g produced. 


Sampling and quantising can be surnrnarised as the two dimensions 
of the conversion process: 


Quantising 
t 


Number of 
bits used 
determines 
dynamic range 
of system 


Sampling 
Sampling Rate 
determines upper 
Audio Frequency 
limit of system 


A block diagram of a digital audio system showing the main 
sections introduced so far is shown below: 


input 
I 


ANTI - ALIAS 
FILTER 


ANALOGUE 
to 
DIGITAL 
CONVERTER 


DIGITAL SECTION 
(RECORDING, TRANSMISSION 
SIGNAL PROCESSING etc.) 


DIGITAL 


ANALOGUE 
CONVERTER 


ANTI - IMAGE 
FILTER 


output 


4 
P r a c t i c a l Problems 


I n o r d e r t o produce a p r a c t i c a l d i g i t a l audio system, t h e r e a r e 
a few problems t h a t need t o be considered. 
One cause of t h e s e 
is t h e h i g h number of b i t s per second, o r bit-rate, of a d i g i t a l 
a u d i o s i g n a l . 
With a sampling rate of 4 8 kHz and using 16 b i t s 
per sample t h e b i t - r a t e w i l l be: 


48,000 x 16 
= 
768,000 b i t s per second 


Handling o r recording t h i s l a r g e amount of information p r e s e n t s 
some d i f f i c u l t i e s . 


BRROR PROTECTION 


I f reasonable recording t i m e s are t o be o b t a i n e d , b i t s w i l l have 
t o be packed very t i g h t l y on the tape. 
Any mechanical recording 
system w i l l s u f f e r from d r o p o u t s caused by i m p e r f e c t i o n s i n t h e 
medium and these w i l l r e s u l t i n some replayed b i t s being wrongly 
i n t e r p r e t e d - ie: a ' 1 ' i n t e r p r e t e d as a ' 0 ' o r v i c e v e r s a . 
I n t e r f e r e n c e on a l i n e o r r a d i o l i n k can have t h e same e f f e c t . 


The a u d i b i l i t y of s u c h a n e r r o r depends on which b i t i n t h e 
sample is a f f e c t e d . 
The s i x t e e n b i t s i n a sample do n o t a l l 


have t h e same value - a '1' i n the left-most p o s i t i o n is 'worth' 
32,768 whereas a '1 ' i n t h e right-most p o s i t i o n is only worth 1. 
These two b i t s a r e hown a s t h e Most S i g n i f i c a n t B i t (USB) and 
Least S i g n i f i c a n t B i t (LSB) r e s p e c t i v e l y : 


MSB 
(worth 50%) 


LSB 
(worth 0.0015%) 


While an e r r o r i n t h e LSB w i l l be inaudible, an e r r o r i n t h e MSB 


w i l l result i n a sample being c o n s i d e r a b l y l a r g e r o r s m a l l e r 
t h a n i t s t r u e value - t h i s is c l e a r l y a u d i b l e a s a s h a r p c l i c k : 


erroneous 
value : 1101 11 10 


(audible as a 
sharp click) 


correct value: 0101 11 10 
. . . 


, 
A l o t of e r r o r s t o g e t h e r w i l l sound s i m i l a r t o s e v e r e i g n i t i o n 
i n t e r f e r e n c e on a c a r radio. 


Since t h e a u d i b l e e f f e c t of j u s t a s i n g l e b i t i n e r r o r can be s o 
s e v e r e , every d i g i t a l audio recording o r transmission system 


needs some form of error p r o t e c t i o n . 
A l l e r r o r p r o t e c t i o n 
systems i n v o l v e t h e a d d i t i o n of e x t r a b i t s t o t h e d i g i t a l a u d i o 
s i g n a l . 
These e x t r a b i t s are c a l c u l a t e d a c c o r d i n g t o a code t o 
g i v e t h e s i g n a l some special mathematical p r o p e r t y . 
An example 
of s u c h a p r o p e r t y might be t h a t the numbers t r a n s m i t t e d w i l l 
a l l be e x a c t l y d i v i s i b l e by one p a r t i c u l a r 'check' number. 
S i n c e t h e y do n o t c a r r y any a u d i o i n f o r m a t i o n , t h e e x t r a b i t s 
are known as redundant b i t s ( o r j u s t redundancy). 


I f t h e s i g n a l d o e s n o t p o s s e s s t h e special p r o p e r t y when 
r e p l a y e d o r r e c e i v e d , t h e presence of an e r r o r has been 


d e t e c t e d . 
S o p h i s t i c a t e d systems c a n a l s o i n d i c a t e e x a c t l y which 
b i t s a r e i n e r r o r , i n which case error c o r r e c t i o n c a n be c a r r i e d 
o u t . 
I f c o r r e c t i o n is n o t p o s s i b l e , t h e n s t e p s a r e t a k e n t o 
c o n c e a l t h e e f f e c t of t h e e r r o r . 
E r r o r p r o t e c t i o n c a n be broken 
down i n t o t h r e e d i s t i n c t s t a g e s : 


(simple systems) 


The system must be designed s o that e r r o r s are d e t e c t e d w i t h 
v e r y h i g h r e l i a b i l i t y s i n c e t h e f o l l o w i n g s t a g e s a r e u s e l e s s i f 
a n e r r o r p a s s e s undetected. 


One of t h e s i m p l e s t e r r o r p r o t e c t i o n s y s t e m s is t h e p a r i t y 
cbeck. 
An e x t r a b i t is appended t o e a c h sample and its v a l u e 


chosen t o make t h e number of ' 1 ' s i n t h e combined word even: 


sample c o n t a i n s an even - 
p a r i t y b i t is - 
0 
number ( 8 ) of ' 1 ' s 


sample c o n t a i n s an odd - 
p a r i t y b i t is - 
1 
number ( 9 ) of ' 1 ' s 


On r e p l a y o r r e c e p t i o n , t h e combined sample and p a r i t y word is 
checked f o r t h e presence of an even number of '1 's. 
If a b i t is 
i n e r r o r , t h i s w i l l be d e t e c t e d by t h e presence of an odd number 
of '1's. 
This simple system cannot i n d i c a t e which is t h e 
erroneous b i t and s o a l l t h a t can be done is t o conceal t h e 
e r r o r . 
Concealment is u s u a l l y done by i n t e r p o l a t i o n . 
This 
involves making an e s t i m a t e of t h e c o r r e c t sample value by 
t a k i n g t h e average of t h e good samples on e i t h e r s i d e : 
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- 
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Concealing an e r r o r i n e v i t a b l y means a s m a l l amount of 
degradation of t h e s i g n a l ; t h e q u a l i t y is no longer 'sealed i n 1 . 
This would not be n o t i c e a b l e i n i s o l a t e d cases, but i f 
concealment was c o n s i s t e n t l y necessary, audio q u a l i t y would 
e v e n t u a l l y s u f f e r . 


Note t h a t , unfortunately, it may be only t h e p a r i t y b i t i t s e l f 
t h a t is i n e r r o r . 
However, t h e system has no way of knowing 
t h i s and s o t h e sample must s t i l l be assumed f a u l t y . 


A p a r i t y check w i l l f a i l t o d e t e c t a double e r r o r , s i n c e i f two 
b i t s i n t h e sample a r e changed, t h e number of ' 1 I s w i l l s t a y 
even. 
I t w i l l however d e t e c t a t r i p l e e r r o r , o r any odd number 
- 
of e r r o r s . 
This i l l u s t r a t e s an important a s p e c t of a l l e r r o r 
p r o t e c t i o n systems: t h e r e w i l l always be a l i m i t t o t h e 
acceptable number of e r r o r s beyond which t h e system w i l l f a i l 
t o work r e l i a b l y . 
This means t h a t t h e designer of an e r r o r 
p r o t e c t i o n system must know i n advance t h e l i k e l y number of 
e r r o r s t o be generated by t h e p a r t i c u l a r medium. 
A p a r i t y check 
would o n l y be s u i t a b l e f o r a system t h a t was l i k e l y t o generate 
a maximum of one e r r o r per sample. 


I f t h e system could i n d i c a t e which b i t was wrong, then e r r o r 
c o r r e c t i o n could be performed. 
Since t h e r e are only two 
p o s s i b l e values f o r a b i t , i f it is known t h a t a b i t is wrong 
t h e n t h e a l t e r n a t i v e value w i l l be t h e c o r r e c t one. 
One such 
technique is t o combine s e v e r a l p a r i t y checks i n a 
two-dimensional system. 


Consider a 16 b i t sample arranged i n a square w i t h p a r i t y b i t s 
added t o e a c h row and e a c h column. The 16 b i t s of t h e sample 
p l u s t h e 9 p a r i t y b i t s are t h e n t r a n s m i t t e d o r recorded a s one 
25 b i t word: 


On reception, the position of t h e e r r o r can be found by a 
cross-reference of the row and column parity checks: 


parity bits 
J for each 


position 
of error 
error 
1 
0 
detected 
in this row 


parity bits for 
each column 
error detected 
in this column 


This system w i l l always detect and correct a single b i t error. 
I t w i l l a l s o detect a l l double e r r o r s but w i l l not be able t o 
correct them. 
T h i s i l l u s t r a t e s a further important aspect of 
error protection systems - they a r e always capable of detecting 
t h e presence of a greater number of e r r o r s than they can 
correct. 
The uncorrectable e r r o r s m u s t then be concealed by 
interpolation. 


There is a price t o be paid f o r e r r o r protection: i n the example 


. above it is necessary t o record or transmit over 50% more b i t s 
than i f there were no error protection. 
However, the presence 
of an e r r o r protection system may allow the b i t s t o be packed 
even more t i g h t l y on t o t h e medium, sometimes t o t h e extent t h a t 
an increase i n playing time is possible. 


I n p r a c t i c e , much more s o p h i s t i c a t e d systems than t h e one above 
are used - f o r example t h e Reed-Solomon codes used i n Compact 


D i s c and DAT. 
These achieve much better d e t e c t i o n and 
c o r r e c t i o n performance w i t h fewer redundant b i t s . 
They involve 
some s e r i o u s mathematics. 


Error I n d i c a t i o n 


Any p r o f e s s i o n a l d i g i t a l audio system should provide some 
i n d i c a t i o n of t h e operation of t h e e r r o r protection. 
A t its 
simplest t h i s w i l l c o n s i s t of two l i g h t s - one t o i n d i c a t e t h e 
d e t e c t i o n and c o r r e c t i o n of an e r r o r and one t o i n d i c a t e when 
c o r r e c t i o n has n o t been p o s s i b l e and concealment has been 
needed : 


I 
Correction 
Concealment 


These l i g h t s provide a u s e f u l ' h e a l t h check' of t h e system. 
I n 
normal operation t h e 'Correctiong l i g h t w i l l f l a s h occasionally, 
but t h e 'Concealment' l i g h t w i l l not f l a s h a t a l l , o r only very 
r a r e l y . 
Unusually frequent f l a s h i n g of t h e c o r r e c t i o n l i g h t 


w i l l g i v e advance warning of a problem before it becomes s o bad 
' 
t h a t concealment is needed. 
For example, a t a p e t h a t was 
wearing o u t could be copied before any damage was done t o t h e 
audio information. 


INTERLEAVING 


A l l e r r o r p r o t e c t i o n systems have a l i m i t t o t h e number of 
e r r o r s t h e y can handle i n a given t i m e , 
Dropouts w i l l tend t o 


- c r e a t e a l o t of e r r o r s a l l a t once, 
This could overwhelm t h e 
e r r o r p r o t e c t i o n system. 
To g e t round this problem, a technique 
known a s i n t e r l e a v i n g is used i n a l l d i g i t a l recording systems, 
and i n some transmission systems. 
The p r i n c i p l e is i l l u s t r a t e d 
below: 


- 
Dropout becomes four single errors 2 


Samples t o be recorded ( a ) , a r e re-ordered according t o a 
predetermined p a t t e r n before being recorded on t a p e ( b ) . When a 
dropout occurs it w i l l tend t o a f f e c t s e v e r a l a d j a c e n t samples, 
however when t h e y a r e rearranged i n t h e c o r r e c t order ( c ) , t h e 
dropout becomes four i s o l a t e d e r r o r s and t h e e r r o r p r o t e c t i o n 
system is a b l e t o cope. 
In p r a c t i c e , much g r e a t e r spreads over 
hundreds o r thousands of samples a r e involved. 


The type of i n t e r l e a v e i l l u s t r a t e d above is known a s a block 
i n t e r l e a v e and is used i n t h e DAT and PCM 1610/30 systems. 
In 


t h i s method, a g r o u p of samples is re-ordered and t h e n recorded 
as a block. 
The re-ordering is done by a r r a n g i n g t h e samples 
row by row i n a memory c h i p and t h e n r e c o r d i n g them 
column as i l l u s t r a t e d below: 


samples 
recorded 
in column 
order 
'r 
1 
I 
1 I I 
i 


I 
I 
I 
I 
I 
9 
I I I 
I I 
13 c 
first block 


column by 


c 
second block 


I n t e r l e a v i n g has i m p o r t a n t i m p l i c a t i o n s f o r e d i t i n g . 
I t will 
o f t e n n o t be p o s s i b l e t o e d i t t h e tape a t t h e d e s i r e d p o i n t 
w i t h o u t l e a v i n g some samples ' s t r a n d e d ' on t h e wrong s i d e of t h e 
c u t . 
I n t h e above i l l u s t r a t i o n , i f t h e d e s i r e d e d i t p o i n t is, 
s a y , between samples 8 and 9, it is n o t p o s s i b l e t o f i n d a p o i n t 
on t h e i n t e r l e a v e d r e c o r d i n g were t h e c u t c a n b e made. 
S o l u t i o n s t o t h i s problem are d i s c u s s e d i n t h e n e x t c h a p t e r . 


BANDWIDTH OF A DIGITAL SIGNAL 


A f u r t h e r p r a c t i c a l problem concerns the f r e q u e n c i e s p r e s e n t i n 
a d i g i t a l s i g n a l ( t h e s e are n o t r e l a t e d i n any way t o the a u d i o 
f r e q u e n c i e s ) . 
Consider t h e s i g n a l r e p r e s e n t i n g t h e 16 b i t 
sample 1010101010101010: 


T h i s sample produces t h e h i g h e s t frequency i n t h e d i g i t a l 
s i g n a l . 
With 48kHz sampling rate t h i s w i l l be: 


8 x 48 kHz 
= 
384 kHz 


Any recording o r t r a n s m i s s i o n medium w i l l c l e a r l y need t o be 
a b l e t o handle high f r e q u e n c i e s . 
A t t h e o t h e r extreme, c o n s i d e r 
t h e s i g n a l produced by t h e 16 b i t sample 0000000000000000: 


T h i s has z e r o frequency - i n o t h e r words, DC. 
The frequency 
range, o r bandwidth, of a d i g i t a l a u d i o s i g n a l is t h e r e f o r e from 


DC t o 384 kHz. 
Such a wide frequency range c a u s e s problems, 
p a r t i c u l a r l y i n magnetic recording where it c a n n o t be recorded 
d i r e c t l y . 


TEE NEED FOR CLOCK SIGNALS 


Another, r e l a t e d problem o c c u r s when t h e d i g i t a l s i g n a l c o n t a i n s 
long s t r i n g s o f ' 1 ' s o r ' 0 ' s : 


When t h i s happens t h e system needs some way of c o u n t i n g how many 
' 1 ' s o r ' 0 ' s t h e r e are. 
To do t h i s it needs a clock s i g n a l : 


clock 
signal 


The c l o c k s i g n a l c o n s i s t s simply of one p u l s e p e r b i t . 
I t 


a l l o w s t h e system t o keep t r a c k - i n much t h e same way as a 
metronome keeps t i m e f o r a musician. 
Clock s i g n a l s are an 
e s s e n t i a l p a r t of e v e r y d i g i t a l system. 
Within a p i e c e of 
equipment t h e c l o c k w i l l be a s e p a r a t e s i g n a l on i t s own wire 
b u t when t h e d i g i t a l s i g n a l is recorded o r t r a n s m i t t e d it is 
u s u a l l y uneconomic and t e c h n i c a l l y u n s a t i s f a c t o r y t o r e c o r d o r 
t r a n s m i t t h e c l o c k s e p a r a t e l y . 
I t must b e combined i n some way 
w i t h t h e d i g i t a l s i g n a l . 


CHANNEL CODES 


The two problems of wide bandwidth and c l o c k requirement are 
overcome by t h e use of a channel code - t h e 'channel' being t h e 
r e c o r d i n g or t r a n s m i s s i o n medium. 


The r o l e of t h e c h a n n e l code is t o combine t h e d i g i t a l a u d i o 
s i g n a l w i t h t h e c l o c k s i g n a l t o produce t h e s i g n a l t h a t is . 
recorded o r t r a n s m i t t e d : 


Digital Audio 


Signal recorded 
or transmitted 


Clock 


The combined s i g n a l must s u i t t h e p a r t i c u l a r medium - u s u a l l y 
t h i s w i l l mean removing any DC c o n t e n t and reducing t h e r a t i o of 
l o w e s t t o h i g h e s t frequency. 
So, while CD, R-DAT, 
DASH etc. a r e 
a l l c a p a b l e of r e c o r d i n g t h e same d i g i t a l a u d i o s i g n a l , t h e y a l l 
use d i f f e r e n t channel codes designed t o work b e s t w i t h e a c h 
medium. 


One of t h e s i m p l e s t channel codes is known as b i p h a s e mark. 
T h i s is a form of frequency modulation: 


Digital 
Audio 


Clock 


Biphase 
mark 


Note t h a t t h e r e is always a t r a n s i t i o n between every b i t - t h i s 
makes regenerating a c l o c k s i g n a l easy. 
' 1 ' s a r e i n d i c a t e d by 
an e x t r a t r a n s i t i o n i n t h e middle of t h e b i t . 
There is no DC i n 
t h e s i g n a l . 
The p r i c e t o be paid f o r t h i s code is an o v e r a l l 
i n c r e a s e i n t h e highest frequency p r e s e n t . 


Biphase mark code is used f o r EBU/SMPTE timecode s i n c e it 
e n a b l e s t h e timecode t o pass through audio c i r c u i t s and be 
recorded on audio t a p e t r a c k s . 
I t is a l s o used f o r t h e AES/EBU 
d i g i t a l audio i n t e r f a c e . 


There a r e many o t h e r more s o p h i s t i c a t e d codes used i n d i g i t a l 
recording systems. 
Examples a r e t h e Eight-to-Fourteen 
Modulation (EFM) used i n Compact D i s c , and t h e 8/10 code of DAT. 


Two ( o r more) d i g i t a l audio s i g n a l s can b e e a s i l y combined 
t o g e t h e r and s e n t down one channel. 
This involves sending 


samples from e a c h s i g n a l i n t u r n - e,g. a l t e r n a t e l e f t and r i g h t 
samples i n t h e c a s e of a s t e r e o s i g n a l : 


A s always, t h e r e is a p r i c e t o be paid: t h e t i m e allowed f o r 
e a c h sample i s now halved which means t h a t t h e channel m u s t be 
a b l e t o handle twice as high a frequency as f o r a mono s i g n a l . 


T h i s technique is known a s multiplexing, o r more s p e c i f i c a l l y 


t i m e d i v i s i o n multiplexing (TDM). 


When a s t e r e o s i g n a l is multiplexed, t h e r e is a choice t o be 
made concerning t h e timing of t h e sampling process. 
One 
p o s s i b l e method is t o sample t h e l e f t and r i g h t channels a t t h e 
same i n s t a n t and then d e l a y the r i g h t channel samples u n t i l it 
is t h e i r t u r n t o be s e n t down t h e channel. 
This is known as 
c o i n c i d e n t sampling : 


Left 


Right 


The o t h e r approach is t o sample l e f t and r i g h t c h a n n e l s 
a l t e r n a t e l y . 
The samples are t h e n a l r e a d y i n t h e c o r r e c t o r d e r 
f o r m u l t i p l e x i n g . 
T h i s is known as a l t e r n a t e sampling: 


Left 


Right 


There is no p a r t i c u l a r advantage t o e i t h e r system, provided t h a t 
the D i g i t a l t o Analogue c o n v e r t e r u s e s t h e same t e c h n i q u e a s t h e 
o r i g i n a l Analogue t o D i g i t a l c o n v e r t e r . 
I f t h i s is n o t t h e 
case, t h e r e s u l t is a d e l a y of h a l f a sampling p e r i o d between 
l e f t and r i g h t c h a n n e l s . 
T h i s amounts t o 11.34 US a t a 44.1 kHz 
sampling rate and is n o t s e r i o u s when l i s t e n i n g i n s t e r e o - it 
r e p r e s e n t s a n e r r o r i n p o s i t i o n i n g t h e l o u d s p e a k e r s , o r your 
head, of less t h a n 4mm. 
However, when t h e s i g n a l s are summed t o 
mono it r e s u l t s i n a s l i g h t loss of hf - amounting t o 1.3 d B a t 
15 kHz and i n c r e a s i n g t o 2.4 dB a t 20 kHz. 


With t h e i n c r e a s i n g d i g i t a l i n t e r c o n n e c t i o n o f a u d i o equipment, 
t h e r e is a need f o r a s t a n d a r d . 
A l l modern systems use 
c o i n c i d e n t sampling. 
Some o l d e r systems, n o t a b l y F1, use 
a l t e r n a t e sampling. 
A c o i n c i d e n t r e c o r d i n g c a n be made on an F1 


system by using the CTC (Coincident Time Correction) f a c i l i t y on 
t h e Propak i n t e r f a c e made by Audio+Design. 
This i n s e r t s an 


11.34 US d e l a y i n t o the r i g h t channel. 
Such a recording would 
t h e n be s u i t a b l e f o r making a CD master. 


Error p r o t e c t i o n is needed i n recording o r transmission systems. 


There a r e t h r e e s t a g e s t o e r r o r protection: 


Detection - Correction - Concealment 


Correction w i l l r e s t o r e a p e r f e c t s i g n a l , whereas concealment 
causes a small amount of degradation. 


I n t e r l e a v i n g is used t o assist t h e e r r o r protection system. 


Channel codes combine c l o c k and d i g i t a l audio s i g n a l s t o produce 
a s i g n a l s u i t a b l e f o r t h e p a r t i c u l a r medium being used. 


Multiplexed s t e r e o s i g n a l s can be use e i t h e r Coincident o r 
A l t e r n a t e sampling. 
The same technique must be used throughout 
a d i g i t a l system, 


5 
D i g i t a l Recording Systems 


There is an ever-increasing number of d i f f e r e n t d i g i t a l audio 
recording systems a v a i l a b l e and it seems u n l i k e l y t h a t t h i s r a t e 


w i l l slow down i n f u t u r e . 
This i n e v i t a b l y causes problems w i t h 
exchange of recordings. 
F o r t u n a t e l y , w i t h d i g i t a l audio it i s 
always p o s s i b l e t o copy t h e recording t o a d i f f e r e n t medium 
without degradation. 


The many d i f f e r e n t systems can be broadly c a t e g o r i s e d a s 
follows : 


Pseudo- 
DAT 
PD 
DASH 
Compact 
WORM 
'Audiof i l e ' 
Video 
D i s c 
etc 


A l l t h e s e systems have many b a s i c c o n c e p t s i n common, 
I n the 
previous c h a p t e r , w e have a l r e a d y introduced t h o s e of : 


E r r o r p r o t e c t i o n 
I n t e r l e a v i n g 
Channel coding 


T h i s c h a p t e r w i l l cover the f u r t h e r g e n e r a l t o p i c s of wow & 
f l u t t e r , subcodes, pre-emphasis, 
monitoring & metering and 
e d i t i n g . 
S p e c i f i c systems w i l l t h e n be d i s c u s s e d i n more 
d e t a i l . 


I t i s p o s s i b l e t o draw a g e n e r a l p i c t u r e of a d i g i t a l r e c o r d e r 
using t h e s e c t i o n s we have a l r e a d y introduced: 


MEDIUM (Tape, Disk etc.) 


ilp : . 


The p h y s i c a l medium and t h e means of r e c o r d i n g on t o it are t h e 
only e s s e n t i a l d i f f e r e n c e s between systems - a l t h o u g h t h e 
channel code and t h e e r r o r p r o t e c t i o n system must be designed t o 
s u i t t h e p a r t i c u l a r medium. 
Whatever t h e medium, t h e y must a l l 
be c a p a b l e of r e c o r d i n g t h e h i g h f r e q u e n c i e s p r e s e n t i n t h e 
d i g i t a l s i g n a l . 


Anti- 
alias 
filter 


- 


ADC 


Separate 
Data & 
Clock 


Interleave. 
Add error 
protection 


DAC 


Channel 
Code 


- 
Anti- 
image 
filter 


* 
De-Interleave 
& 
error correct 


\ O/P 
/ 


ELIMINATING WOW & FLUTTER 


One s p e c i f i c a t i o n common t o a l l d i g i t a l machines i s e f f e c t i v e l y 
z e r o wow & f l u t t e r . 
Bow is t h i s achieved? 


Any mechanical t r a n s p o r t , whether analogue o r d i g i t a l , w i l l 
s u f f e r from speed f l u c t u a t i o n s . 
The advantage of a d i g i t a l 
s i g n a l is t h a t w e know e x a c t l y t h e r a t e a t which t h e samples 
should be reproduced. 
It is t h e r e f o r e p o s s i b l e t o c o r r e c t t h e 
t i m i n g e r r o r s i n t r o d u c e d by t h e t r a n s p o r t . 
What is r e q u i r e d i s 
a memory t h a t c a n t a k e i n samples a t t h e approximate rate from 
t h e t a p e o r d i s c and t h e n send them o u t a g a i n a t t h e p r e c i s e l y 


c o r r e c t i n t e r v a l s , under c o n t r o l of a c r y s t a l o s c i l l a t o r . 
The 
o p e r a t i o n of t h i s b u f f e r memory c a n be l i k e n e d t o a t a n k of 
water: 


c 
constant rate 
samples 


The b u f f e r memory must b e a b l e t o s t o r e enough samples t o a l l o w 


the w o r s t speed f l u c t u a t i o n s t o be i r o n e d o u t , without t h e 
' t a n k ' e i t h e r overflowing o r running d r y . 
A t y p i c a l c a p a c i t y 


f o r t h i s memory i n a CD p l a y e r would be a b o u t 500 samples, o r 
1 lms d u r a t i o n . 


A b u f f e r memory is a l s o needed t o undo t h e i n t e r l e a v i n g used on 
t h e r e c o r d i n g and t h e same memory c a n perform b o t h t a s k s . 


- SUBCODES 


S i n c e t h e d i g i t a l a u d i o s i g n a l is simply a stream o f numbers, it 
is r e l a t i v e l y e a s y t o r e c o r d some a d d i t i o n a l numbers along w i t h 
t h e d i g i t a l audio. 
These a d d i t i o n a l numbers are known as a 
subcode. 
V i r t u a l l y a l l d i g i t a l r e c o r d i n g systems have a 
subcode, even i f o n l y of l i m i t e d c a p a c i t y . 


A t t h e i r s i m p l e s t , subcodes are used t o c a r r y i n f o r m a t i o n a b o u t 
t h e s i g n a l s u c h as t h e sampling rate, or whether pre-emphasis 
was used. 
More advanced systems c a n u s e t h e subcode t o c a r r y 


timecodes o r t e x t , s u c h as t h e t i t l e of t h e recording. 
Some 
systems have c o n s i d e r a b l e subcode c a p a c i t y : f o r example, on a 
Compact Disc t h e r e is one b y t e ( e i g h t b i t s ) f o r e v e r y s i x p a i r s 
of a u d i o samples - e q u i v a l e n t t o roughly two typed s h e e t s of A4 
paper e v e r y second o r a s t i l l - f r a m e of video e v e r y minute. 


PRB-EMPHASIS 


I n most systems, h i g h frequency pre-emphasis c a n b e a p p l i e d t o 
t h e a u d i o s i g n a l b e f o r e t h e Analogue t o D i g i t a l c o n v e r t e r . 
T h i s 
g i v e s a b e t t e r s i g n a l t o n o i s e r a t i o - a t t h e expense of a lower 
o v e r a l l l e v e l i f t h e s i g n a l c o n t a i n s h i g h l e v e l s of HF. 
V i r t u a l l y a l l r e c o r d i n g systems u s e t h e same s t a n d a r d response 
f o r t h i s pre-emphasis: 


De-emphasis 


An equal but o p p o s i t e RF c u t (de-emphasis) is a p p l i e d a f t e r t h e 
D i g i t a l t o Analogue c o n v e r t e r t o r e s t o r e a f l a t o v e r a l l frequency 
response. 


I f pre-emphasis is a p p l i e d , t h i s f a c t must be recorded on t h e 
t a p e , u s u a l l y by t h e value of s i n g l e b i t i n t h e subcode, s o t h a t 
t h e machine can c o r r e c t l y apply de-emphasis on replay. 
The 
pre-emphasis ' s t a t u s 1 of t h e s i g n a l must a l s o be passed on t o 
any copy of t h e recording. 


While most recording systems provide f o r pre-emphasis, it seems 
t o be l i t t l e used on commercially a v a i l a b l e C D s a t present. 
Notable exceptions are recordings made on t h e PCM-F1 o r 701 
system, which has pre-emphasis a p p l i e d permanently. 
Several of 
t h e BBC FX C D s were recorded using t h i s system. 


D i g i t a l s i g n a l s w i t h pre-emphasis can cause problems i f it is 
required t o mix o r e d i t them i n d i g i t a l form w i t h o t h e r s i g n a l s 
t h a t do n o t have pre-emphasis. 
The pre-emphasis must f i r s t be 
removed and a d i g i t a l f i l t e r (see Chap 6 ) is necessary i f t h i s 
. 
is t o be done without f i r s t r e t u r n i n g t h e s i g n a l t o t h e analogue 
state. 


MONITORING 


I n a d i g i t a l r e c o r d e r , t h e q u a l i t y of t h e recording is 
determined by t h e Analogue t o D i g i t a l c o n v e r t e r - t h i s is where 
any d i s t o r t i o n w i l l t a k e place. 
Provided t h e system is working 
c o r r e c t l y , t h e t a p e p l a y s no p a r t i n determining a u d i o q u a l i t y . 
This is c o n t r a r y t o analogue recording where most of t h e 
d i s t o r t i o n is caused by the t a p e i t s e l f . 


To some e x t e n t , t h i s l e s s e n s t h e need f o r off-tape monitoring 
w i t h d i g i t a l r e c o r d e r s - a l l t h a t is necessary t o judge t h e 
q u a l i t y of t h e recording is t o monitor t h e s i g n a l f e d through 
t h e two c o n v e r t e r s t a g e s . 
A t y p i c a l arrangement on a machine 
without off-tape monitoring, s u c h as an o r d i n a r y DAT recorder, 
would be : 


Bargraph 
meter 


A p o s s i b l e advantage of t h i s system is t h e absence of any 
s i g n i f i c a n t d e l a y t o t h e monitored s i g n a l . 
I t does n o t though, 
provide t h e confidence t h a t comes from off-tape monitoring. 


On a machine w i t h t h e n e c e s s a r y e x t r a h e a d ( s ) f o r o f f - t a p e 
m o n i t o r i n g , a t y p i c a l s e t - u p would be: 


I meter I 


RECORD 
PLAY 
I CIRCUITS I / 
CIRCUITS 


REC 
PB 


Thi s w i l l g i v e t h e familiar d e l a y a s s o c i a t e d w i t h o f f - t a p e 
monitoring. 


I n p r a c t i c e , t h e most i m p o r t a n t c a u s e o f d i s t o r t i o n t h a t needs 
t o be monitored f o r is o v e r l o a d . 
The behaviour of d i g i t a l 
r e c o r d e r s when overloaded is c o m p l e t e l y d i f f e r e n t t o t h a t of 
a n a l o g u e machines, 
I n a n analogue r e c o r d e r , o v e r l o a d is a 
g r a d u a l p r o c e s s producing i n c r e a s i n g amounts o f d i s t o r t i o n a s 
t h e i n p u t l e v e l is i n c r e a s e d . 
The u s a b l e peak r e c o r d i n g l e v e l 
i s determined by t h e amount o f d i s t o r t i o n t h a t is a c c e p t a b l e - 
f o r example peak l e v e l is o f t e n t a k e n t o be t h e p o i n t where 3% 
t h i r d harmonic d i s t o r t i o n i s produced. 


I n a d i g i t a l r e c o r d e r - or any d i g i t a l a u d i o system - o v e r l o a d 
t a k e s p l a c e s u d d e n l y a t a p r e c i s e l y d e f i n e d p o i n t : when t h e 
maximum c o u n t allowed by t h e number of b i t s i n t h e system is 
reached, 
When 1111111111111111 is reached t h e s i g n a l can go no 


f u r t h e r and is c l i p p e d , producing l a r g e amounts of d i s t o r t i o n : 


T h i s would be v e r y c l e a r l y a u d i b l e on, f o r example, piano. 
On 
t h e o t h e r hand, pop and rock music c a n o f t e n b e q u i t e s e v e r l y 


c l i p p e d b e f o r e it becomes n o t i c e a b l e . 


T h i s sudden o n s e t o f c l i p p i n g is e a s i l y a u d i b l e on t o n e and 
p r o v i d e s a v e r y u s e f u l r e f e r e n c e l e v e l which c a n be found by ear 
t o w i t h i n a f r a c t i o n of a dB w i t h o u t t h e use of l i n e - u p t a p e s o r 
meters. 


Meters and Overload I n d i c a t o r s 


The i d e a l meter f o r d i g i t a l a u d i o is a l s o somewhat d i f f e r e n t t o 
that used f o r analogue r e c o r d i n g . 
On c o n v e n t i o n a l analogue 
meters, whether PPM or VU, t h e r e is a p o r t i o n of t h e scale above 
peak l e v e l (PPM 6 o r 0 W ) . 
I n a d i g i t a l system it is n o t 
p o s s i b l e f o r t h e s i g n a l t o go beyond peak l e v e l and it is 
meaningless t o p r o v i d e any room f o r t h i s on t h e scale. 
Peak 
l e v e l is t h e r e f o r e a t f u l l scale. 


S i n c e a d i g i t a l meter c a n n o t i n d i c a t e that a n overload has t a k e n 


place, it is u s u a l t o provide an a d d i t i o n a l overload i n d i c a t o r . 
T h i s d e t e c t s when t h e d i g i t a l sample value is e q u a l t o t h e peak 
( ' a l l o n e s ' ) value. 
However, it is p o s s i b l e f o r a s i n g l e sample 
of a s i g n a l t h a t j u s t reaches peak t o have t h i s value without 
being clipped. 
Also, t h e q u a n t i s i n g process means t h a t s e v e r a l 
samples a t t h e peak of a low frequency s i g n a l may genuinely have 
peak value. 
Most systems, t h e r e f o r e , do n o t i n d i c a t e overload 
u n t i l s e v e r a l ( f o r example, f o u r ) consecutive samples have 
e q u a l l e d peak value. 


Meters on d i g i t a l machines are t y p i c a l l y of t h e bargraph t y p e 
w i t h a t r u e peak response. 
A PPM, d e s p i t e its name, has a 
reduced response t o s h o r t d u r a t i o n peaks - f o r example, a 5ms 
peak-level t r a n s i e n t w i l l under-read by 4dB. 
Depending on t h e 
n a t u r e of t h e programme m a t e r i a l , a PPM w i l l t e n d t o i n d i c a t e 
lower t h a n a d i g i t a l meter. 
An e x t r a 6dB of headroom (i.e. a 
d i g i t a l peak l e v e l of +14dBu) seems i n p r a c t i c e t o be enough t o 
allow f o r t h i s . 
A g r e a t e r allowance may be needed i f 
pre-emphasis is used on the recorder. 


ED1 TING 


Although c o n v e n t i o n a l ' r a z o r b l a d e ' e d i t i n g is p o s s i b l e w i t h 


DASH and ProDigi open reel r e c o r d e r s , i n g e n e r a l t h e e d i t i n g o f 
d i g i t a l r e c o r d i n g s w i l l b e c a r r i e d o u t b y dubbing - a process 
s i m i l a r t o v i d e o e d i t i n g . 


With a d i g i t a l r e c o r d i n g it is n o t p o s s i b l e t o simply d r o p i n t o 
record a t t h e desired e d i t p o i n t and r e c o r d new m a t e r i a l . 
T h i s 
is f o r two reasons: 


1 ) The samples on t h e t a p e w i l l have been i n t e r l e a v e d - 
t h i s means it is n o t p o s s i b l e t o i d e n t i f y a p o i n t on 
t h e t a p e which r e p r e s e n t s t h e e d i t p o i n t . 


2 ) I n o r d e r t o produce t h e e q u i v a l e n t of t h e angled c u t 
used i n analogue e d i t i n g t o avoid c l i c k s , a s h o r t 
c r o s s - f a d e must be performed a t t h e e d i t p o i n t . 


These two problems a r e overcome by t h e use of a n advanced r e p l a y 
head i n t h e arrangement shown below: 
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The advance r e p l a y head is positioned s u c h t h a t , w i t h t h e 
cross-fader i n t h e ' t a p e ' p o s i t i o n , t h e t i m e taken t o 
d e - i n t e r l e a v e t h e s i g n a l o f f tape and i n t e r l e a v e it a g a i n is 
e q u a l t o t h e t i m e taken f o r the t a p e t o move from t h e advance 
r e p l a y head t o t h e record head. 


The sequence of e v e n t s required t o perform an e d i t is as 
follows : 


1 ) The cross-fader is i n i t i a l l y i n t h e ' t a p e ' p o s i t i o n . 
The e x i s t i n g material is replayed from a p o i n t ahead 
of t h e d e s i r e d e d i t point. 
A f t e r a s h o r t t i m e (about 
100ms) t h e s i g n a l a t t h e record head w i l l be 
i d e n t i c a l t o t h a t a l r e a d y on t h e t a p e . 


The machine now switches i n t o record, re-recording 
what is a l r e a d y on t h e t a p e . 
T h i s can be monitored 
a t t h e machine output. 


The a c t u a l e d i t is performed a t t h e required p o i n t by 
cross-fading t o t h e i n p u t s i g n a l . 
This may be l i v e 
m a t e r i a l - as i n a 'drop-in' - o r recorded m a t e r i a l 
from another machine. 


If it is required t o 'drop-out' 
back t o t h e o l d 
m a t e r i a l , t h i s is done by cross-fading back t o t h e 
s i g n a l from t h e advance r e p l a y head. 
A f t e r a s h o r t 
while t h e record head w i l l b e re-recording what is 
a l r e a d y on t h e tape. 
The machine can now s w i t c h 
o u t o f record. 


The e d i t can be rehearsed by following t h e above sequence but 
without p u t t i n g t h e machine i n t o record. 
The d u r a t i o n of t h e 
cross-fade is t y p i c a l l y about 10ms (a 45 degree c u t on a 15ips 
s t e r e o t a p e g i v e s a cross-fade of about 7ms). 
On some systems 
t h e d u r a t i o n can be a l t e r e d t o s u i t t h e programme m a t e r i a l - 


equivalent to a variable angle of cut with a razor blade. 


To produce a practical editing system, some form of edit 
controller is necessary to enable accurate location of edit 
points (with a 'search wheel' similar to that found on 
professional CD players) and to automatically carry out the 
correct editing sequence. 


PSEUDO-VIDEO 
SYSTEM 


Early d i g i t a l a u d i o recording systems made use of video t a p e 
r e c o r d e r s (VTRs), as t h e s e were t h e only r e a d i l y a v a i l a b l e 
machines t h a t could record t h e necessary high frequencies. 
I n 
o r d e r t o record high f r e q u e n c i e s , high t a p e speeds are 
necessary. 
The only p r a c t i c a l way t o achieve t h e s e h i g h speeds, 
without excessive t a p e consumption, is t o use a r o t a r y head. 


To use a VTR t o record d i g i t a l a u d i o it is necessary t o 
'package' t h e d i g i t a l a u d i o samples i n t o a s t a n d a r d video 
s i g n a l . 
I n e f f e c t , ' 1 ' s and ' 0 ' s are represented by white and 
black d o t s r e s p e c t i v e l y . 
A complete recording system w i l l 
c o n s i s t of two u n i t s - t h e VTR and a 'PCM a d a p t o r ' : 


The 'PCM a d a p t o r ' c o n t a i n s t h e ADCs and DACs p l u s t h e e r r o r 
p r o t e c t i o n c i r c u i t s and formats t h e d i g i t a l a u d i o i n t o a video 
s i g n a l . 


The two most common pseudo-video systems a r e t h e Sony 1610 
s e r i e s , of which t h e 1630 is t h e c u r r e n t model, and t h e Sony 
PCM-F 1 : 
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The Sony 1610 s e r i e s is designed f o r use w i t h U-matic 


PCM adaptor 


Video '\ 


VTR 
- 


f o r m a t machines and is t h e system used f o r Compact D i s c 
mastering. 
The 161 0 systems o n l y work w i t h machines of 
t h e 525/60 (American) v i d e o s t a n d a r d i n o r d e r t o allow 
i n t e r n a t i o n a l exchange of r e c o r d i n g s . 
A s w i t h CD, t h e y 
use 44.lkHz sampling rate and 16 b i t q u a n t i s i n g , 
C o i n c i d e n t sampling is used. 


The Sony PCM-F1 ( i n c l u d i n g t h e PCM 501/601/701 u n i t s ) 
was designed as a domestic system using Betamax machines 
and conforming t o t h e EIAJ s t a n d a r d ( E l e c t r o n i c 
I n d u s t r i e s A s s o c i a t i o n of J a p a n ) , 
It e x i s t s i n b o t h 
525/60 and 625/50 v i d e o s t a n d a r d s . 
It also u s e s 44,lkHz 
sampling rate, w i t h t h e o p t i o n of e i t h e r 14 o r 16 b i t 
q u a n t i s i n g . 
The EIAJ s t a n d a r d s p e c i f i e s 14 b i t 


q u a n t i s i n g b u t t h e PCM-F1 c a n a l s o r e c o r d 16 b i t s i g n a l s 
a t t h e expense of reduced e r r o r p r o t e c t i o n a b i l i t y , 
A l t e r n a t e sampling is used: I f a PCM-F1 r e c o r d i n g is t o 
be t r a n s f e r r e d d i g i t a l l y t o a n o t h e r format that e x p e c t s 
c o i n c i d e n t sampling, a t i m i n g c o r r e c t i o n d e v i c e s u c h as 
t h e 'Propak' should be used d u r i n g t h e i n i t i a l r e c o r d i n g 
(see Chap 4 ) . 
Sony n a t u r a l l y s p e c i f i e d t h e use of 
Betamax machines f o r t h e PCM-F1 b u t VHS, o r any o t h e r 
machine of e q u a l o r b e t t e r performance, c a n a l s o be 
used. 


Copying of pseudo-video r e c o r d i n g s should be done v i a t h e PCM 
a d a p t o r , u s i n g t h e 'copy' o u t p u t . 
T h i s a l l o w s t h e e r r o r 
p r o t e c t i o n system t o ' c l e a n up' t h e s i g n a l between g e n e r a t i o n s . 
I f t a p e s are c o p i e d as s i m p l e v i d e o dubs, e r r o r s w i l l accumulate 
and t h e p r o t e c t i o n system w i l l b e unable t o correct them. 


Off-tape monitoring is n o t g e n e r a l l y p o s s i b l e w i t h pseudo-video 
systems, 
However, Sony have produced a s p e c i a l l y modified 


U - m a t i c machine w i t h an e x t r a p a i r of heads t o e n a b l e o f f - t a p e 
monitoring u s i n g t h e 1630 system. 


DAT 


The use of video recorders t o record d i g i t a l audio is wasteful 
s i n c e t h e i r performance, p a r t i c u l a r l y s i g n a l t o noise r a t i o , is 
much b e t t e r than is necessary. 
By designing a r o t a r y head 
system s p e c i f i c a l l y f o r d i g i t a l audio, it is possible t o 
considerably reduce t a p e consumption. 


DAT borrows only t h e b a s i c r o t a r y head idea from video 
recording, but uses considerably s i m p l i f i e d mechanics. 
This is 
achieved mainly by wrapping t h e t a p e round t h e head drum f o r 
only 90 degrees: 


HEAD 
- 
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Note t h a t there is no erase head; a new recording is simply 
recorded over t h e old one. 
The drum r o t a t e s a t 2000 rpm 
(compared t o 1500 rpm f o r a video cassette machine). 
There a r e 
two heads which l a y down diagonal t r a c k s a t an angle of 6 
degrees t o the edge of t h e tape: 


S i n c e t h e r e a r e o n l y two heads on t h e drum and t h e tape is 
wrapped round it f o r o n l y 90 d e g r e e s , t h e r e w i l l b e no head i n 
c o n t a c t w i t h t h e tape f o r h a l f t h e t i m e . 
T h i s means t h a t t h e 
recording w i l l b e made i n b u r s t s , w i t h e a c h t r a c k l a s t i n g 7.5ms 
b u t c o n t a i n i n g 15ms worth of a u d i o . 


The DAT s t a n d a r d d o e s n o t s p e c i f y a p a r t i c u l a r drum s i z e o r 
a n g l e of wrap, merely t h e t r a c k p a t t e r n produced. 
Thus it is 
p o s s i b l e t o use a smaller drum w i t h a g r e a t e r a n g l e of wrap t o 
produce a more compact machine. 


The t a p e speed is o n l y 8.15mm/s 
and t h e t r a c k s recorded on t a p e 
a r e 13um wide ( a human h a i r is roughly 50um d i a m e t e r ) . 
Recording t r a c k s t h i s narrow i s e a s y compared t o t h e problem of 
f i n d i n g them a g a i n on r e p l a y . 
To make t h i s p o s s i b l e , Automatic 
Track Finding (ATF) s i g n a l s are i n c l u d e d i n e a c h t r a c k . 
The 
p a t t e r n of t r a c k s on t h e tape l o o k s l i k e t h i s : 
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There a r e t h r e e d i s t i n c t areas i n e a c h t r a c k . 
D i g i t a l a u d i o is 
recorded i n t h e c e n t r e s e c t i o n . 
E i t h e r s i d e of t h i s are t h e ATF . 
s i g n a l s . 
Subcode is recorded a t the e n d s of e a c h t r a c k . 
The 
l i n e a r t r a c k s are p r i m a r i l y t o p r o t e c t a g a i n s t edge damage and 
are of l i t t l e use a t s u c h a low t a p e speed. 
The o p e r a t i o n of 
t h e ATF system is a s follows: 


The ATF s i g n a l s have d i f f e r e n t f r e q u e n c i e s and timing i n e a c h 


a d j a c e n t t r a c k . 
I f t h e head starts t o s t r a y o f f c o u r s e , it w i l l 
p i c k up some of t h e ATF s i g n a l from a n a d j a c e n t t r a c k . 
From 
t h i s , t h e machine can t e l l i n which d i r e c t i o n t h e head has 
s t r a y e d and can a d j u s t t h e c a p s t a n speed t o b r i n g t h e head back 
o n t o t h e c o r r e c t t r a c k . 


The DAT s t a n d a r d a l l o w s f o r t h e recording of 16 b i t a u d i o 
s i g n a l s a t any of t h e t h r e e s t a n d a r d sampling rates. 
P r o v i s i o n 
is a l s o made f o r long p l a y and four-channel modes, e a c h using 
12 b i t q u a n t i s i n g and 32kHz sampling rate. 
Ordinary domestic 
machines have s o f a r n o t been a b l e t o record a t 44.lkHz i n an 
a t t e m p t t o prevent d i r e c t d i g i t a l copying of CDs. 
They can 
however p l a y back pre-recorded t a p e s a t 44.lkHz. 


Off-tape monitoring c a n be provided on DAT machines by f i t t i n g a 
second pair of heads on t h e drum. 
They c a n b e c o n v e n i e n t l y 
arranged a t 90 d e g r e e s t o t h e f i r s t heads. 
T h i s second p a i r a r e 
a l s o d i s p l a c e d along t h e a x i s of the drum s o t h a t t h e y r e p l a y a 
few t r a c k s behind t h e record heads. 
T h i s d e l a y is e q u a l t o t h e 
decode/encode d e l a y s o that the machine can be used f o r e d i t i n g , ' 
i n t h e p r o c e s s a l r e a d y d e s c r i b e d . 
For e d i t i n g , t h e heads swap 
f u n c t i o n s s o that t h e f i r s t p a i r are used f o r advance r e p l a y and 
t h e second pair become t h e record heads. 
Off-tape monitoring 


w i l l n o t be p o s s i b l e d u r i n g e d i t i n g o p e r a t i o n s . 


DAT has a n even g r e a t e r subcode c a p a c i t y t h a n CD. 
The subcode 
carries t i m e i n f o r m a t i o n and markers, known as IDS, t o e n a b l e 
t h e machine t o f i n d s e c t i o n s of a recording. 
S i n c e t h e subcode 
is recorded i n a separate area of tape, it is p o s s i b l e t o r e c o r d 
subcode i n f o r m a t i o n i n d e p e n d e n t l y of t h e audio. 
The head w i l l 


. s w i t c h i n t o r e c o r d f o r t h e subcode s e c t i o n of e a c h t r a c k and 
b a c k t o p l a y f o r t h e a u d i o s e c t i o n - a l l w h i l e r o t a t i n g a t 
2000rpm! 


DAT timecode 


The subcode c a n a l s o b e used t o r e c o r d SMPTE/EBU timecode. 
The 
f i r s t machine t o be a b l e t o d o t h i s w a s t h e Fostex D-20. 
T h i s 
appeared b e f o r e any s t a n d a r d had been agreed between a l l 
manufacturers. 
There is now a proposed IEC s t a n d a r d which is 
u n f o r t u n a t e l y d i f f e r e n t t o t h e Fostex system. 
The e s s e n t i a l 
d i f f e r e n c e between t h e two is that t h e Fostex behaves i n a 
similar way t o a c e n t r e - t r a c k 1/4" machine, r e c o r d i n g what is 
f e d i n t o it. The proposed new s t a n d a r d w i l l c o n v e r t t h e i n p u t 
timecode, of any frame r a t e , i n t o a s p e c i a l DAT timecode. 
On 


r e p l a y t h i s is c o n v e r t e d back t o timecode of t h e a p p r o p r i a t e 
frame rate. 
I t would a l s o be p o s s i b l e t o c o n v e r t it t o any o f 
t h e o t h e r frame rates. 
This might have advantages f o r 
i n t e r n a t i o n a l exchange of programmes. 


AUDIO IN DIGITAL VIDEOTAPE 


The D-1 (component) and D-2 
(composite) d i g i t a l v i d e o f o r m a t s 
e a c h p r o v i d e f o r r e c o r d i n g f o u r independent a u d i o c h a n n e l s using 


48kHz sampling r a t e and up t o 20 bits/sample. 
The d i g i t a l a u d i o 


s i g n a l s are recorded using the same heads a s t h e video but i n 
s e p a r a t e a r e a s of t h e t r a c k . 
The audio t r a c k s can t h e r e f o r e be 
recorded and e d i t e d independently of t h e video. 
Each audio 
sample is recorded t w i c e i n d i f f e r e n t areas of t h e tape t o g i v e 
maximum e r r o r p r o t e c t i o n . 


STATIONARY HEAD SYSTEMS 


There a r e two s i m i l a r b u t incompatible ' f a m i l i e s ' of s t a t i o n a r y 
head, open reel, d i g i t a l a u d i o recorders. 
These a r e DASH 
( D i g i t a l Audio S t a t i o n a r y Head) made by Sony and Studer, and 
ProDigi ( o r PD) from Mitsubishi and O t a r i . 


Each of t h e s e is a set of s p e c i f i c a t i o n s f o r a v a r i e t y of 
p o s s i b l e machines using d i f f e r e n t t a p e widths and speeds t o 
provide anything from 2 t o 4 8 channels. 
Only some of t h e 
p o s s i b l e v a r i a n t s have a c t u a l l y been manufactured i n e a c h c a s e . 


Both systems have been designed t o allow ' r a z o r blade' e d i t i n g - 
indeed t h i s is t h e main reason f o r an open r e e l format. 
I f 
razor blade e d i t i n g is n o t required, it is much more 
s a t i s f a c t o r y t o keep t h e t a p e protected i n s i d e a c a s s e t t e . 
(However, u n t i l r e c e n t advances i n r o t a r y head recording, an 
open reel, s t a t i o n a r y head system was a l s o t h e only p r a c t i c a l 
way of producing a d i g i t a l m u l t i t r a c k machine. ) 


A high t a p e speed is necessary t o record a d i g i t a l audio s i g n a l 
and t h e DASH and PD m u l t i t r a c k machines run a t 30ips. 
This i s 
compensated f o r by using double-play t a p e , s i n c e print-through 
is of no concern i n d i g i t a l recording. 
I t is p o s s i b l e t o reduce 
t h e necessary t a p e speed by spreading t h e b i t s from one audio 
channel over s e v e r a l t r a c k s on t h e t a p e : 


Tirnetode track m $ $ $ - - 


Bits distributed 
. 
across several tracks 
to reduce tape 
speed. 


T h i s t e c h n i q u e r e s u l t s i n a p r o p o r t i o n a t e r e d u c t i o n i n tape 
speed. 
For example, a s t e r e o DASH machine u s e s f o u r tape t r a c k s 
f o r e a c h c h a n n e l and r u n s t h e t a p e a t quarter-speed ( 7 1/2 i p s ) . 


It is n o t p o s s i b l e t o r e p l a y a d i g i t a l r e c o r d i n g u n l e s s t h e t a p e 
i s running a t c l o s e t o t h e c o r r e c t speed. 
To e n a b l e e d i t p o i n t s 
t o be l o c a t e d by manually rocking t h e s p o o l s , a low q u a l i t y 
analogue r e c o r d i n g is made a l o n g s i d e t h e d i g i t a l t r a c k s . 
Other 
t r a c k s a r e a l s o provided f o r timecode and d a t a recording. 


When e d i t i n g a d i g i t a l tape, it is n o t p o s s i b l e t o use a n angled 
c u t t o avoid c l i c k s - t h i s is meaningless i n t h e c o n t e x t o f a 
d i g i t a l s i g n a l . 
I n s t e a d , t h e t a p e is c u t a t r i g h t - a n g l e s and 
t h e machine must perform a s h o r t c r o s s - f a d e a t t h e c u t p o i n t . 
T h i s is achieved a s f o l l o w s : 


On t h e t a p e , odd-numbered samples a r e s e p a r a t e d from t h e i r 
even-numbered p a r t n e r s by a b o u t 1 1/2". 
A s a r e s u l t , f o r a 
s h o r t p e r i o d e i t h e r s i d e of t h e e d i t , h a l f t h e samples from 
a f t e r t h e ' o u t ' p o i n t and h a l f t h e samples from b e f o r e t h e ' i n ' 
p o i n t w i l l be a v a i l a b l e : 


EDIT 
POINT 
+ 
EVEN samples 
here belong with 
ODD samples after edit point. 


--T-- 
ODD samples 
here belong with 
EVEN samples before edit point c 
CROSS-FADE 


I n t e r p o l a t i o n is used t o r e c r e a t e t h e s e s i g n a l s and a c r o s s - f a d e 
can t h e n be performed. 
T h i s i n e v i t a b l y means some r e d u c t i o n i n 
q u a l i t y a t e d i t p o i n t s . 


A v e r s i o n of DASH, known a s Twin DASH, a v o i d s t h i s d e g r a d a t i o n 
by r e c o r d i n g t h e s i g n a l twice. 
The odd-even s e p a r a t i o n is 
r e v e r s e d on t h e second r e c o r d i n g . 
T h i s means t h a t a l l t h e 
samples are a v a i l a b l e f o r t h e c r o s s - f a d e and no d e g r a d a t i o n 
r e s u l t s . 
Twin DASH u s e s o n l y two tape t r a c k s f o r e a c h copy o f 
e a c h a u d i o c h a n n e l and t h e r e f o r e needs twice t h e t a p e speed of 
o r d i n a r y DASH. 


COUPACT DISC 


The recording on a Compact Disc is i n t h e form of a continuous 
s p i r a l t r a c k of p i t s pressed i n t o t h e d i s c from t h e label side, 
This s u r f a c e of t h e d i s c is then coated w i t h a r e f l e c t i v e l a y e r 
of aluminium followed by a t h i n l a y e r of p r o t e c t i v e lacquer. 
The l a b e l is p r i n t e d on t h i s lacquer l a y e r . 
From the playing 
s i d e t h e p i t s appear as bumps. 


The s i g n a l is conveyed by v a r i a t i o n s i n t h e l e n g t h of b o t h t h e 
bumps and t h e spaces between them, 
The recording is replayed 
using a low-power laser beam t o d e t e c t t h e bumps. 
This beam 
passes through t h e t h i c k n e s s of t h e d i s c and is focussed on t h e 
r e f l e c t i v e l a y e r : 


Protective 
coating (30pm) 
\ 1 
Label side 


.................................. 
.................................. 
- - 
Dust 


The beam e n t e r s t h e d i s c as an out-of-focus 
s p o t about 0.7mrn i n 
diameter. 
This provides c o n s i d e r a b l e immunity t o d i r t and 
s c r a t c h e s on t h e playing s i d e . 
Any o b s t r u c t i o n needs t o be 
r e l a t i v e l y l a r g e before it w i l l obscure t h e l a s e r beam. 
This i s 
a k i n t o looking through a window a t a d i s t a n t o b j e c t : d i r t and 
s c r a t c h e s on t h e window a r e o u t of focus and t h e r e f o r e n o t 
v i s i b l e . 


Small s c r a t c h e s on t h e playing s i d e can s u c c e s s f u l l y be removed 
by p o l i s h i n g w i t h a mild a b r a s i v e such a s T-Cut. 
I n c o n t r a s t , 
t h e l a b e l s i d e is much more v u l n e r a b l e t o s c r a t c h e s s i n c e t h e 
r e f l e c t i v e l a y e r is p r o t e c t e d by a lacquer c o a t i n g only 30um 
t h i c k . 
A s c r a t c h t h a t went through t h i s l a c q u e r would d e s t r o y a 
l a r g e amount of audio information. 


The l a s e r beam is focussed t o a s p o t t h a t i s l a r g e r t h a n t h e 
width of a bump: 


/ / 
/ 
/' 


Spotsize 
1.2pm 


The s i z e of t h e s p o t is such t h a t a roughly e q u a l amount of 
l i g h t is r e f l e c t e d from t h e t o p of t h e bump as from t h e 
surrounding d i s c s u r f a c e . 
The height of t h e bump is 
approximately a quarter-wavelength of t h e l a s e r l i g h t . 
Therefore, l i g h t r e f l e c t e d from t h e s u r f a c e of t h e d i s c w i l l 
have t r a v e l l e d a half-wavelength f u r t h e r t h a n t h e l i g h t 


. r e f l e c t e d from t h e t o p of t h e bump. 
This w i l l r e s u l t i n 
c a n c e l l a t i o n of t h e r e f l e c t e d beam i n t h e presence of a bump, 
which is d e t e c t e d by a photo-diode. 


The bumps, and t h e spaces between them, do n o t d i r e c t l y 
r e p r e s e n t t h e ' 1 ' s and ' 0 ' s of t h e d i g i t a l audio s i g n a l . 
I n s t e a d t h e y a r e t h e r e s u l t of t h e channel code used. 
Each 
group of e i g h t b i t s ( h a l f of a sample) is converted i n t o a 
p a r t i c u l a r p a t t e r n of bumps and spaces. 
There a r e 256 d i f f e r e n t 
p a t t e r n s , r e p r e s e n t i n g t h e 256 p o s s i b l e combinations of e i g h t 
b i t s . 


The recording s t a r t s from t h e c e n t r e of t h e d i s c and has a 
c o n s t a n t l i n e a r speed. 
This means that t h e r o t a t i o n a l speed of 
t h e d i s c v a r i e s from about 500 rpn a t t h e beginning t o about 200 
rpm a t t h e end. 
This speed can be c o n t r o l l e d by monitoring t h e 
c o n t e n t of t h e b u f f e r memory used t o d e - i n t e r l e a v e t h e recording 
and remove wow & f l u t t e r : 


samples from 
disc, with 
wow & flutter 


constant rate 
samples 


- 


(a) correct speed 
(b) too fast 


1 I2 


(c) too slow 


0 
0 
MEMORY 
- -- 
--- 
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The player a t t e m p t s t o keep t h e memory h a l f f u l l . 
I f t h e memory 
s t a r t s t o f i l l up t h e d i s c is t u r n i n g t o o f a s t and must be 
slowed down; i f t h e memory s t a r t s t o run d r y t h e d i s c w i l l be 
speeded up. 
The p r e c i s e l i n e a r speed of t h e d i s c is n o t 
important: t h e player w i l l t u r n it as f a s t as necessary t o 
produce 44,100 samples/second. 
Discs are made w i t h l i n e a r 
speeds between 1.2 and 1.4 m / s , 
t h e slower speed allowing t h e 
f u l l 75 minutes playing time. 


CD subcode 


The subcode on a CD is arranged i n blocks of e i g h t 96-bit words. 
There a r e 75 blocks per second - t h i s is t h e o r i g i n of t h e 
'frame' count displayed on some players. 
The e i g h t words a r e 
known as P f Q f R f SITf U,V & W. 
A t t h e p r e s e n t t i m e only t h e use of 
t h e P and Q words has been defined. 
The main purpose of t h e s e 
is t o e n a b l e t h e player t o l o c a t e t h e s t a r t of e a c h t r a c k and 
d i s p l a y a running t i m e . 


The P word is t h e pause f l a g . 
This is t h e d i r e c t e q u i v a l e n t t o 
t h e s c r o l l used on LPs between bands. 
The P word is set t o a l l 
' 1 ' s f o r a period of a t l e a s t 2 seconds b e f o r e t h e s t a r t of e a c h 
- 
t r a c k . 
The s t a r t of t h e t r a c k is defined a s t h e p o i n t where t h e 


P word becomes zero again. 
J u s t a s w i t h t h e s c r o l l on an LP, 
t h e r e may be no a c t u a l pause i n t h e programme between t r a c k s . 
The timing of t h e pause f l a g must be set as p a r t of t h e CD 
mastering process. 


The Q word c a r r i e s two t i m e counts: One o f a b s o l u t e t i m e from 
t h e s t a r t of d i s c and a n o t h e r giving t i m e from s t a r t of e a c h 
t r a c k . 
I t a l s o c a r r i e s t h e Index markers - t h e s e i n d e n t i f y 
p a r t i c u l a r p o i n t s of i n t e r e s t w i t h i n a t r a c k , Index 1 always 
being the s t a r t of t h e t r a c k . 
Also contained w i t h i n t h e Q word 
is information on pre-emphasis s t a t u s , 2 o r 4 channel use of 
d i s c , and whether o r n o t copying is permitted. 


A t t h e very s t a r t of a d i s c , b e f o r e any programme, t h e Q word 
carries a Table of Contents. 
T h i s is a list o f a l l t h e t r a c k s 
on t h e d i s c w i t h their s t a r t t i m e s and can be used by t h e player 
t o f i n d i n d i v i d u a l t r a c k s . 
When a d i s c is f i r s t i n s e r t e d i n a 
player t h e Table of Contents is read and on a d i s c w i t h many 
t r a c k s ( t h e maximum is 9 9 ) , t h e p l a y e r t a k e s a n o t i c e a b l e t i m e 


* t o do t h i s . 


The use of t h e remaining subcode words (R,S,T,U,V&W) is y e t t o 
be defined, b u t one p o s s i b l e use would be t o c a r r y t h e t i t l e of 
e a c h t r a c k o r t h e words of a song. 


RECORDABLE OPTICAL DISK SYSTEMS 


Considerable r e s e a r c h and development e f f o r t is c u r r e n t l y being 
d i r e c t e d towards v a r i o u s t y p e s of recordable o p t i c a l d i s k s . 
There a r e two c a t e g o r i e s of t h e s e : t h e so-called WORM (Write 
Once, Read Many) d i s k s that can only be recorded once, and d i s k s 
t h a t can be e r a s e d and recorded again. 
A WORM system is 
a v a i l a b l e that records Compact Discs which can be played on 
o r d i n a r y players. 


BARD DISK SYSTEMS 


Since a d i g i t a l audio s i g n a l is simply a s e r i e s of numbers, it 
can be r e a d i l y be recorded on t h e same 'hard1 o r 'Winchester1 
d i s k systems used by computers. 
The s t o r a g e c a p a c i t y required 
f o r a reasonable recording t i m e is l a r g e compared t o t h a t 
provided by t h e average d e s k t o p personal computer. 
A t y p i c a l PC 
might have a 20 Mbyte (20 m i l l i o n b y t e s ) hard d i s k - t h i s would 
only be capable of recording about 3 minutes of audio. 
C u r r e n t l y a v a i l a b l e systems use d i s k s of s e v e r a l hundred Mbytes 
and provide around two hours of s t e r e o recording. 
Examples a r e 
A u d i o f i l e , Soundstation and t h e Tablet. 


Any s e c t i o n of a recording on a hard d i s k can be accessed 
extremely f a s t . 
Editing on a hard d i s k system merely i n v o l v e s 
compiling a list of t h e wanted s e c t i o n s . 
These are t h e n 
replayed from t h e o r i g i n a l recording i n t h e c o r r e c t order. 
A 
b u f f e r memory i s used t o cover t h e gaps while t h e d i s k jumps t o 
t h e next s e c t i o n . 
This i s comparable t o t h e o l d technique of 
jump e d i t i n g on 7 8 r p d i s k recordings. 
The s o u r c e m a t e r i a l is 
l e f t untouched. 
I t is t h e r e f o r e p o s s i b l e t o produce s e v e r a l 
d i f f e r e n t v e r s i o n s of an i t e m , without using up any e x t r a audio 
- 
s t o r a g e c a p a c i t y . 


A l l hard d i s k systems have one problem i n common: t h e d i s k s 
cannot be removed from t h e system. 
Programme m a t e r i a l must 
t h e r e f o r e be copied i n t o t h e system f o r e d i t i n g and t h e n copied 
o u t a g a i n t o another medium b e f o r e any new m a t e r i a l c a q e 
e d i t e d . 
Current developnents i n recordable o p t i c a l d i s k s may 
s o l v e t h i s problem. 
These would be removeable and could be used 
f o r t h e long-term s t o r a g e of m a t e r i a l . 


SUMMARY 


Wow 
& F l u t t e r i s e l i m i n a t e d by t h e use of a b u f f e r memory. 


E x t r a d a t a c a n be recorded along w i t h t h e a u d i o as a subcode. 


Pre-emphasi-s may be a p p l i e d b e f o r e t h e ADC. 
c' 


Monitoring t h r o u g h t h e c o n v e r t e r s t a g e s w i l l a l l o w any 
d i s t o r t i o n , s u c h as o v e r l o a d , t o be heard even w i t h o u t o f f - t a p e 
monitoring. 


Overloading a d i g i t a l system r e s u l t s i n hard c l i p p i n g . 


D i g i t a l m e t e r s have peak l e v e l a t f u l l scale. 
Overload 
i n d i c a t o r s a r e a l s o provided. 


E d i t i n g r e q u i r e s t h e use of a n advance r e p l a y head t o overcome 
i n t e r l e a v i n g and e n a b l e a s h o r t c r o s s - f a d e a t t h e e d i t p o i n t . 


Rotary head systems e n a b l e high head-to-tape 
s p e e d s w i t h low 
t a p e consumption. 


S t a t i o n a r y head systems reduce r e q u i r e d t a p e speed by s p r e a d i n g 


t h e d i g i t a l s i g n a l a c r o s s s e v e r a l t r a c k s . 
They a l l o w 
c u t - e d i t i n g . 


O p t i c a l d i s k systems i n c l u d e Compact D i s c , WORM d i s k s and 
e r a s e a b l e d i s k s . 


Hard d i s k systems r e c o r d a u d i o as computer d a t a . 
E d i t i n g is 
non-destructive. 
M a t e r i a l must be copied i n t o and o u t of t h e 


system t o a n o t h e r medium f o r s t o r a g e . 


6 
D i g i t a l Audio S i g n a l P r o c e s s i n g 


It is p o s s i b l e t o p r o c e s s a n a u d i o s i g n a l t o t a l l y i n t h e d i g i t a l 
domain. 
Fading, mixing and e q u a l i s a t i o n can be executed by 
mathematical o p e r a t i o n s and c a r r i e d o u t by a computer. 
T h i s 
c h a p t e r d e s c r i b e s b a s i c d i g i t a l a u d i o p r o c e s s e s s u c h as f a d i n g 
and mixing and b u i l d s on t h e i d e a s p r e s e n t e d t o d e s c r i b e more 
complex p r o c e s s e s s u c h as e q u a l i s a t i o n , oversampling and sample 
rate conversion. 


FADING 


A c o n v e n t i o n a l f a d e r on a sound d e s k changes t h e amplitude of a n 
a u d i o s i g n a l . 
I f t h e amplitude of a d i g i t a l s i g n a l is reduced, 
t h e system w i l l o p e r a t e normally w i t h no n o t i c e a b l e change u n t i l 
a p o i n t is reached when it is no l o n g e r p o s s i b l e t o d i s t i n g u i s h 
between a b i n a r y one o r z e r o , a f t e r which t h e d i g i t a l system 


w i l l cease t o f u n c t i o n . 


The amplitude of a n a u d i o s i g n a l is measured when c o n v e r t i n g 
from t h e analogue t o t h e d i g i t a l domain and t h e d i g i t a l s i g n a l 
is a code r e p r e s e n t i n g t h e amplitude o f t h e a u d i o s i g n a l a t 
r e g u l a r i n t e r v a l s of t i m e . 
T h e r e f o r e a d i g i t a l f a d e r needs t o 
change t h e magnitude of a code and e a c h sample needs t o have its 
magnitude changed by t h e same amount. 
T h i s is c a r r i e d o u t by 
m u l t i p l i c a t i o n . 
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Input Samples 
...................................................... 
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Output Samples 
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To f a d e a s i g n a l up and down you need t o change t h e number you 
a r e m u l t i p l y i n g by. 
An o p e r a t i o n a l d i g i t a l f a d e r needs a 
c o n t r o l s u r f a c e t o p r o v i d e t h e changing 
m u l t i p l i e r number t h a t 
r e p r e s e n t s t h e r e q u i r e d programme c o n t r o l . T h i s c o n t r o l s u r f a c e 


c a n c o n s i s t of rows of b u t t o n s t o f a d e up o r down, r o t a r y 
c o n t r o l s , f l a t s l i d e r s o r q u a d r a n t s t y l e f a d e r s . 
It might a l s o 
be p o s s i b l e t o c o n t r o l t h e g a i n of t h e d i g i t a l f a d e r from a 
memory system, s o providing automated a s s i s t a n c e t o t h e mixing 
p r o c e s s . 


I f t h e m u l t i p l y i n g number has a v a l u e g r e a t e r t h a n one, t h e 
f a d e r w i l l have g a i n and t h e a m p l i f i e d s i g n a l may exceed t h e 
maximum p e r m i t t e d v a l u e w i t h i n t h e d i g i t a l system. 
T h i s w i l l be 
a u d i b l e as d i s t o r t i o n . 
T h i s i s d i r e c t l y e q u i v a l e n t t o t h e 
problem of having i n s u f f i c i e n t headroom i n a n analogue sound 
desk. 


I f t h e multiplying number is less than one, t h e f a d e r w i l l 
reduce t h e l e v e l of t h e s i g n a l . 
There may be a remainder 
q u a n t i t y a t t h e end of t h e c a l c u l a t i o n , which, i f ignored, is an 
e r r o r and may be a u d i b l e as increased quantising noise. To 
overcome t h e problems of e r r o r s , processing is u s u a l l y c a r r i e d 
o u t t o a g r e a t e r accuracy than t h a t of t h e analogue t o d i g i t a l 
encoding. 
A 16 b i t encoded s i g n a l may be processed by an 18 
b i t , 24 b i t o r even l a r g e r computing system. 


MIXING 


I n a conventional sound desk s e v e r a l s i g n a l s a r e added t o g e t h e r 
i n t h e mixing process. 
In a d i g i t a l system, with t h e amplitude 
of audio s i g n a l s represented by a binary code, mixing i s c a r r i e d 
o u t by t h e mathematical operation of a d d i t i o n . 
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Samples 


(Mixer) 


Again, as w i t h f a d i n g , t h e p r o c e s s i n g needs t o be c a r r i e d o u t t o 
a g r e a t e r a c c u r a c y t h a n t h e o r i g i n a l analogue t o d i g i t a l 
encoding t o reduce cumulative errors. 


EQUALISATION 


An e q u a l i s e r changes t h e g a i n of a system depending on t h e 
frequency of t h e s i g n a l p a s s i n g t h r o u g h it, U n f o r t u n a t e l y 
t h e r e is no s i n g l e s i m p l e mathematical process t o r e p r e s e n t 
e q u a l i s a t i o n . 


A d i g i t a l a u d i o s i g n a l i s a code which r e p r e s e n t s a s t r i n g of 
d i s c r e t e a u d i o samples t a k e n a t r e g u l a r p o i n t s i n t i m e . 
I n d i v i d u a l samples c o n t a i n no i n f o r m a t i o n a b o u t t h e frequency of 
an a u d i o s i g n a l . 


I f two samples are compared it is p o s s i b l e t o make c a l c u l a t i o n s 
a b o u t t h e rate o f change o f t h e a m p l i t u d e of a n a u d i o s i g n a l . 
It is p o s s i b l e , though, t o draw many d i f f e r e n t wave s h a p e s , 
r e p r e s e n t i n g d i f f e r e n t f r e q u e n c y s i g n a l s , t h r o u g h o n l y t w o 
samples. 
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The argument c a n b e developed t o s a y that t h r e e samples w i l l 
g i v e a better i d e a a b o u t t h e rate o f change o f a s i g n a l and t h a t 


f o u r samples w i l l g i v e a n even b e t t e r p i c t u r e o f t h e s i t u a t i o n . 
The p o i n t is t h a t many samples need t o b e compared t o g a i n 


a c c u r a t e i n f o r m a t i o n a b o u t the a u d i o f r e q u e n c y o f a s i g n a l . 
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F i l t e r i n g is o f t e n c a r r i e d o u t by a d e v i c e containing a chain of 
d e l a y elements which s t o r e a number of s u c c e s s i v e audio samples, 
I n d i v i d u a l audio samples a r e moved along t h e c h a i n and t h e i r 
l e v e l s a r e changed by a m u l t i p l i e r a t t h e o u t p u t of e a c h d e l a y 
element. 
The value of e a c h m u l t i p l i e r c o e f f i c i e n t g i v e s a 
weighting f a c t o r t o t h e r e l a t i v e importance of e a c h sample i n 
determining t h e frequency of an audio s i g n a l . The o u t p u t s of 
e a c h of t h e m u l t i p l i e r s a r e added t o g e t h e r and a new set of 
d i g i t a l samples a r e produced t h a t r e p r e s e n t t h e f i l t e r e d audio 


jsignal0 
The v a l u e s of t h e m u l t i p l i e r c o e f f i c i e n t s determines t h e 
frequency response of t h e f i l t e r . 


I n pure mathematical t h e o r y t h e l e n g t h of t h e d e l a y elements 
should s t r e t c h t o i n f i n i t y , 
This i s a l i t t l e i m p r a c t i c a l i n 
r e a l i t y , s o t h e number of d e l a y elements is chosen s o that t h e 
f i l t e r o n l y produces an e r r o r of less t h a n one q u a n t i s i n g l e v e l , 


A r e a l f i l t e r w i l l c o n s i s t of between 50 and 250 d e l a y elements 
and m u l t i p l i e r s . 


The a d v a n t a g e of t h i s d e s i g n is t h a t d e s p i t e a p p e a r i n g complex 
it c o n t a i n s many i d e n t i c a l e l e m e n t s which c a n b e i n c o r p o r a t e d 
i n t o i n t e g r a t e d c i r c u i t s and produced c h e a p l y . 
These f i l t e r s 
are m a i n l y found i n t h e i n p u t and o u t p u t s t a g e s of equipment 
where f i x e d f i l t e r i n g is r e q u i r e d . 
I t is d i f f i c u l t t o produce a 
f i l t e r t h a t has v a r i a b l e c h a r a c t e r i s t i c s , as s u c h a l a r g e number 
o f m u l t i p l i e r c e f f i c i e n t s would need t o b e changed t o produce 
d i f f e r e n t r e s p o n s e s . 


VARIABLE FILTERS 


O p e r a t i o n a l f i l t e r s w i t h v a r i a b l e c h a r a c t e r i s t i c s are u s u a l l y 


based on a n a p p a r e n t l y less complex d e s i g n . 


I n a b a s i c d e s i g n of one o f t h e s e v a r i a b l e f i l t e r s t h e c h a i n of 
d e l a y e l e m e n t s from t h e p r e v i o u s d e s i g n is r e p l a c e d by a s i n g l e 
d e l a y e l e m e n t which has i t s o u t p u t f e d back t h r o u g h a m u l t i p l i e r 
t o i t s i n p u t . 
Many samples are t a k e n i n t o a c c o u n t because t h e 


r e s u l t o f e a c h m u l t i p l i c a t i o n is passed around a loop. 
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With t h i s t y p e of f i l t e r d e s i g n it is o n l y necessary t o change a 
s i n g l e m u l t i p l i e r c o e f f i c i e n t t o vary the response o f t h e 
f i l t e r . 


I t is p o s s i b l e t o come up w i t h many d i f f e r e n t d e s i g n s of 
v a r i a b l e f i l t e r s g i v i n g band pass o r s h e l f responses. 
A l l such 
d e s i g n s use t h e b a s i c b u i l d i n g blocks o f a d d i t i o n , 
m u l t i p l i c a t i o n and s i n g l e sample delay. 


OVERSAMPLING 


The r e c o n s t r u c t i o n (anti-image) f i l t e r i n a d i g i t a l t o analogue 
c o n v e r t e r can be a complex p i e c e of analogue e l e c t r o n i c s i f 
image f r e q u e n c i e s are t o be s i g n i f i c a n t l y a t t e n u a t e d without 
g r e a t l y a f f e c t i n g t h e required programme. 
A d i g i t a l system 
sampling a t 48kHz w i t h an audio frequency response t o 20kHz 
needs a f i l t e r t h a t can provide a l a r g e amount of a t t e n u a t i o n i n 
less t h a n half an octave, 
This is a very t a l l o r d e r f o r any 
~ " d e s i ~ n . 
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One s o l u t i o n t o t h e problem of designing complex analogue 
f i l t e r s i s t o c a r r y o u t some of t h e f i l t e r i n g i n t h e d i g i t a l 
domain using a process known a s oversampling. 


When a n a u d i o s i g n a l is c o n v e r t e d i n t o d i g i t s t h e a u d i o is 
sampled and held u n t i l t h e n e x t sample is t a k e n . 
The r e s u l t of 
t h i s p r o c e s s is t h a t t h e a u d i o s i g n a l is g i v e n a ' s t e p p e d 1 
appearance i f viewed g r a p h i c a l l y . 


T h i s s t e p p e d waveform c a n be t h o u g h t of as a h i g h f r e q u e n c y 
s i g n a l being added t o t h e a u d i o s i g n a l which c a n be removed 
t o t a l l y by f i l t e r i n g . 
It is i m p o r t a n t t o understand t h a t 
sampling d o e s n o t reduce t h e q u a l i t y of t h e a u d i o s i g n a l . The 
a u d i o c a n always be recovered by f i l t e r i n g i f t h e sampling is 
c a r r i e d o u t a t a f a s t enough rate. 


Oversampling is t h e p r o c e s s where e x t r a samples are c a l c u l a t e d 
and i n s e r t e d between e x i s t i n g a u d i o samples. 
T h i s c a l c u l a t i o n 
process is c a r r i e d o u t by a d i g i t a l f i l t e r and t h e r e f o r e t h e 
newly c r e a t e d samples are real a u d i o samples, as i f t h e o r i g i n a l 
s i g n a l had been sampled a t a h i g h e r rate, and n o t merely 


i n t e r p o l a t i o n s . 
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I n c r e a s i n g t h e sampling r a t e c a u s e s a n i n c r e a s e i n t h e image 
f r e q u e n c i e s 
and s o t h e a n a l o g u e f i l t e r r e q u i r e d on t h e o u t p u t 
o f a c o n v e r t e r c a n have a t u r n o v e r frequency o u t s i d e the a u d i o 
range w i t h a r e l a t i v e l y low s l o p e . 


4 x Sampling frequency 


1- 
Over three octaves -4 


The oversampling f i l t e r c o n s i s t s of a l o n g c h a i n o f d e l a y 
e l e m e n t s which hold e a c h sample f o r , s a y , a q u a r t e r o f a sample 
p e r i o d i n t h e case of f o u r times oversampling. 
The v a l u e s o f 
t h e m u l t i p l i e r c o e f f i c i e n t s are c a l c u l a t e d t o produce t h e 
i n t e r m e d i a t e samples. 


There are f u r t h e r b e n e f i t s from using oversampling. 


1 ) The n o i s e produced by t h e d i g i t a l t o analogue c o n v e r s i o n p r o c e s s 
is s p r e a d o u t over a f a r wider spectrum t h a n i f a c o n v e n t i o n a l 


DAC w a s used. 
Much of t h e n o i s e is d i s t r i b u t e d beyond t h e 
normal h e a r i n g range and a f t e r analogue f i l t e r i n g an improvement 
of a b o u t 6dB i n t h e n o i s e f i g u r e s c a n b e gained when u s i n g f o u r 
times oversampling. 


2 ) The costs involved are mainly i n t h e d e s i g n s t a g e of a n 
i n t e g r a t e d c i r c u i t . 
D i g i t a l oversampling f i l t e r s are cheaper 
t h a n their analogue e q u i v a l e n t s , 


F i l t e r s are now used i n some compact d i s c p l a y e r s , t a k i n g t h e 
process of oversampling t o what c a n seem t o be an extreme. 
These f i l t e r s oversample t h e d i g i t a l s i g n a l by some 256 times. 
I t may a p p e a r t h a t t h i s is j u s t t h e Hi-Fi market c h a s i n g numbers 
and s p e c i f i c a t i o n s , b u t t h e s e oversampling d e v i c e s do have 
advantages over more c o n v e n t i o n a l f i l t e r i n g p r o c e s s e s . 


It is d i f f i c u l t t o build low priced good q u a l i t y 16 b i t DACs 
because it is d i f f i c u l t t o measure voltage t o t h e accuracy 
required. I t is, though, possible t o measure t i m e t o a very high 
degree of accuracy. 
The reason f o r t h e high degree of 
oversampling is t o produce a d i g i t a l output t h a t c o n s i s t s of a 
stream of one b i t samples. This is a t a high b i t r a t e of around 


11M bits/sec. 
The output d i g i t a l s i g n a l is a ' 1 ' f o r a r i s i n g 
audio s i g n a l and a ' 0 ' f o r a f a l l i n g one. 
The conversion from a 
d i g i t a l s i g n a l can be seen on t h e two diagrams below. 
The 
higher t h e density of d i g i t s i n one d i r e c t i o n , t h e higher t h e 
audio voltage produced. n 
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SAMPLE RATE CONVBRSION 


There a r e t h r e e sampling r a t e s which w i l l be found i n a d i g i t a l 
studio:. 


32kHz 
Standard f o r d i s t r i b u t i o n , OBs etc 


44.lkHz Standard f o r commercial recordings, C D s etc 


48kHz 
Standard f o r p r o f e s s i o n a l recordings and desks. 


An a l l - d i g i t a l sound s t u d i o has t o be a b l e t o accept source 
m a t e r i a l a t a l l t h r e e s t a n d a r d sampling rates and probably a l s o 
a t some odd non-standard rate, s o it w i l l be necessary t o 
c o n v e r t between t h e sampling rates. 


The conversion between 32kHz and 48kHz sampling is r e l a t i v l y 
s t r a i g h t f o r w a r d . 
There is a common m u l t i p l e of 96kHz and 
oversampling techniques can be used f o r sample r a t e conversion. 
I f a 48kHz system is oversampled twice t h e sampling r a t e becomes 


96kHz and i f you then t a k e every t h i r d sample you a r r i v e a t 
32kHz. 
The conversion from 32kHz t o 48kHz sampling is simply 
t h e r e v e r s e of t h i s process. 
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Problems w i t h conversion arise when there is no simple 
mathematical r e l a t i o n s h i p between d i f f e r e n t sampling rates, i n 
p a r t i c u l a r when converting t o o r from a sampling rate of 44. lkBz 
(as is used f o r CD recordings). 
The lowest common m u l t i p l e 
between 48kHz and 44.lkEz is around 7MHz (7.056MHz). 
Oversampling a t t h e s e f r e q u e n c i e s w h i l s t n o t impossible is v e r y 
d i f f i c u l t and expensive, s o o t h e r more involved t e c h n i q u e s have 
been developed. 
These techniques involve oversampling t o 
produce a l a r g e number of samples and t h e n using v a r i a b l e 
f i l t e r s t o produce samples a t t h e r e q u i r e d rate. This process 
can a l s o be used t o c o n v e r t between a v a r i a b l e sample rate (e.g. 
from a vari-speed t a p e machine), and a f i x e d sampling rate (e.g. 
a desk i n p u t ) . I t can a l s o be used t o synchronise a d i g i t a l 
source which is remote from a s t u d i o . 


Audio signal processing can be carried out in a digital form by 
using three basic digital building blocks:- 


1) a multiplier 


2 ) an adder 
3) a single sample delay 


Fading is carried out by a MULTIPLIER. 


Mixing is carried out by an ADDER. 


Equalisation is carried out by using MULTIPLIERS, ADDERS 
and DELAYS. 


OVERSAMPLING is a form of filtering used in the digital domain 
to simplify the design of analogue reconstruction filters. 


Oversampling techniques are also used to convert between digital 
audio systems with different sampling rates. 


7 
D i g i t a l Systems i n Sound S t u d i o s 


There are s e v e r a l d i f f e r e n t t y p e s of d i g i t a l i n p u t s and 
o u t p u t s that c a n a p p e a r on d i f f e r e n t m a n u f a c t u r e r s equipment. 
T h i s c h a p t e r o u t l i n e s some problems which may occur when 
c o n n e c t i n g t o g e t h e r a number of p i e c e s o f d i g i t a l a u d i o 
equipment. 
The AES/EBU d i g i t a l a u d i o i n t e r f a c e is d e s c r i b e d 
i n d e t a i l , along w i t h t h e domestic SPDIF and t h e 
multi-channel system MAD1 . 


THE PROBLEM 


When c o n n e c t i n g t o g e t h e r two p i e c e s of analogue a u d i o 
equipment t h e r e are s e v e r a l problems t h a t c a n o c c u r : 


(i) I n p u t and o u t p u t c o n n e c t o r s may be of d i f f e r e n t t y p e s . 


(ii) I n p u t and o u t p u t s i g n a l l e v e l s may be d i f f e r e n t . 


(iii) The d e v i c e s may have balanced or unbalanced i n p u t s and 
o u t p u t s . 


A t the end of t h e day it is n e a r l y always p o s s i b l e , w i t h a 
l i t t l e i n g e n u i t y and t h e h e l p of a f e w s p e c i a l l y made u p 
l e a d s , t o c o n n e c t t o g e t h e r any amount of equipment t o b u i l d a 
l a r g e a u d i o system. 


D i g i t a l systems have a l l of t h e s e problems and more. 
S i g n a l 
parameters s u c h as sample rate, number of b i t s per sample and 
e r r o r p r o t e c t i o n p r e v e n t one p i e c e of equipment understanding 
d i g i t a l s i g n a l s from another. 


Each manufacturer o f d i g i t a l a u d i o equipment h a s developed 


- its own d i g i t a l i n t e r f a c e s t a n d a r d s . 
These i n t e r f a c e s n o t 


o n l y communicate d i g i t a l a u d i o b u t also convey i n f o r m a t i o n 
a b o u t s i g n a l p a r a m e t e r s s u c h as pre-emphasis and p o s s i b l y CD 
sub-codes. 
A l l t h i s is encoded a l o n g w i t h t h e d i g i t a l audio. 


A s no one s t a n d a r d has been a g r e e d between the m a n u f a c t u r e r s 


many d i f f e r e n t d i g i t a l t e c h n i q u e s are used and a wide range 
o f c a b l e s and plugs. 
So it is v e r y d i f f i c u l t t o p i c k and 
choose t h e b e s t p i e c e s of equipment from d i f f e r e n t companies 
when b u i l d i n g a d i g i t a l s t u d i o . 


I t is p o s s i b l e t o buy 'bodge boxes' t h a t w i l l c o n v e r t between 


d i f f e r e n t i n t e r f a c e systems. 
These boxes have t o t r a n s l a t e 
from one d i g i t a l system t o a n o t h e r , s o t h e y c a n b e complex 
p i e c e s of e l e c t r o n i c equipment and t h e r e f o r e expensive, 


SYNCEIRONISATION 


I f you c o n n e c t t o g e t h e r two p i e c e s of d i g i t a l a u d i o 
equipment, s u c h as a n RDAT p l a y e r and a d i g i t a l sound d e s k , 
you may still e x p e r i e n c e a f e w t e c h n i c a l problems, even w i t h 
a common i n t e r f a c e s t a n d a r d . 
I n a s h o r t p e r i o d of t i m e , t h e 
t a p e machine may p l a y one m i l l i o n b i t s of d i g i t a l a u d i o i n t o 
t h e d e s k , b u t t h e d e s k p r o c e s s i n g may need one m i l l i o n and 
one b i t s . 
T h i s is a s m a l l d i f f e r e n c e b u t t h a t s i n g l e b i t is 


l a r g e enough t o p r e v e n t t h e c o r r e c t o p e r a t i o n of t h e sound 
desk. 
T h i s problem is o f t e n a u d i b l e as r e g u l a r c l i c k s o r 
t i c k i n g a s t h e d e s k f i n d s it i m p o s s i b l e t o read a s e c t i o n o f 


d a t a and d i s r e g a r d s an audio sample. 
I t is t h e r e f o r e 
necessary t o make s u r e that a l l d i g i t a l a u d i o equipment 
w i t h i n a s t u d i o is synchronised t o o p e r a t e a t e x a c t l y t h e 
same b i t r a t e . 


The s i m p l e s t way of achieving synchronisation is t o feed a 
'clock' s i g n a l , that c o n s i s t s of a c h a i n of ON/OFF pulses, 
from one piece of equipment t o a l l t h e o t h e r s . 
This is t h e 
mechanism used i n t h e Sony D i g i t a l I n t e r f a c e (SDIF) and 
Mitsubishi (Melco) systems. 
Both i n t e r f a c e s communicate 20 
b i t s of d i g i t a l audio along w i t h c o n t r o l b i t s i n a s e r i a l 
form. 
I n a s t e r e o system t h r e e c a b l e s a r e r e q u i r e d , one f o r 
e a c h of t h e a u d i o channels and one f o r a c l o c k s i g n a l 
( a 
square wave a t t h e sampling frequency). 


The AES/EBU i n t e r f a c e encodes t h e c l o c k s i g n a l w i t h i n t h e 
d i g i t a l a u d i o s i g n a l . 
This has t h e e f f e c t of i n c r e a s i n g t h e 
b i t rate and has t h e advantage of g r e a t l y s i m p l i f i n g t h e 
s i g n a l r o u t i n g i n l a r g e s t u d i o s . 
You only need t o provide a 
f e e d of t h e d i g i t a l audio s i g n a l from one machine t o another 
t o achieve synchronisation. 
I n a s t u d i o where s e v e r a l t a p e 
machines feed i n t o a desk you must t a k e a feed of a sync. 
s i g n a l t o an i n p u t on a l l machines t o achieve t o t a l 
synchronisation. 
This is simple as t h e record i n p u t s t o most 
t a p e machines a l s o a c t as synchronisation i n p u t s and t h e desk 
o u t p u t s i g n a l w i l l a c t as t h e sync. s i g n a l . 
It is common t o 
f i n d d i g i t a l i n p u t s r e p l a y only d e v i c e s where t h e s e i n p u t s 
are f o r s y n c h r o n i s a t i o n of t h e o u t p u t s i g n a l . 


I n t e r c o n n e c t i o n s r e q u i r e d w i t h i n a small d i q i t a l s t u d i o . 
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I n l a r g e s t u d i o s and s t u d i o c e n t r e s s i g n a l s need t o b e 


r o u t e d between d i f f e r e n t areas. 
So it is d e s i r a b l e t o have 
a s i n g l e m a s t e r c l o c k s i g n a l s y n c h r o n i s i n g a l l d i g i t a l a u d i o 
equipment. 
Two s t a n d a r d s of c l o c k r e f e r e n c e s have been 
proposed:- 


output 


Grade 1 f o r u s e i n l a r g e s t u d i o complexes and 
o p e r a t i n g t o a v e r y h i g h a c c u r a c y ( e x p e n s i v e ) . 


Grade 2 f o r u s e i n s i n g l e s t u d i o s , a l i t t l e less 
a c c u r a t e ( a l i t t l e less e x p e n s i v e ) . 


TEE AES/EBU DIGITAL AUDIO INTERFACE 


The AES/EBU d i g i t a l a u d i o i n t e r f a c e w a s developed i n 1985 t o 
overcome t h e problem of i n c o m p a t i b i l i t y between d i f f e r e n t 
m a u f a c t u r e r s equipment and h a s become e s t a b l i s h e d as a n 
i n d u s t r y s t a n d a r d . 
It is a serial d i g i t a l a u d i o i n t e r f a c e 
system and operates a t sampling rates o f e i t h e r 32kHz, 
44.lkHz o r 48kHz. 


The b a s i c parameters o f t h e i n t e r f a c e are:- 


*** 
It is a balanced s y s t e m and u s e s e x i s t i n g a u d i o c a b l e s . 


*** 
The d i g i t a l s i g n a l l e v e l is between 3 & 10 v o l t s . 


*** 
The s t a n d a r d c o n n e c t o r is t h e 3 p i n XLR. 


Male XLR f o r o u t p u t , Female XLR f o r i n p u t . 


*** 
A l l c o n n e c t o r s are l a b e l l e d D i g i t a l Audio I n p u t o r Output 


The a b b r e v i a t i o n s DI and DO are a l s o used. 


*** 
T h a t no more t h a n f o u r pieces o f equipment c a n b e 


c o n n e c t e d a c r o s s a s i n g l e o u t p u t . 


The AES/EBU d i g i t a l a u d i o i n t e r f a c e is a s t e r e o system. 
A 
frame o f d i g i t a l i n f o r m a t i o n c o n s i s t s o f two sub-frames, 
one 
f o r t h e l e f t c h a n n e l and one f o r the r i g h t . 
Each sub-frame 
c o n s i s t s o f 32 b i t s and is used t o communicate one sample of 
d i g i t a l audio. 


The s t r u c t u r e of a sub-frame is shown below:- 


b-, 
1 Subframe (32 Bits) 


The Preamble b i t s i n d i c a t e t h e beginning o f e a c h sub-frame of 


i n f o r m a t i o n . 
There are t h r e e d i f f e r e n t t y p e s of preamble and 
t h e y i n d i c a t e whether t h e s i g n a l f o l l o w i n g is:- 


L 
M 
5 
S 
B 
B 


1 ) L e f t c h a n n e l of a s t e r e o p a i r 


2 ) R i g h t c h a n n e l of a s t e r e o p a i r 
3 ) L e f t c h a n n e l and t h e f i r s t o f a sequence of 192 


samples. T h i s is needed t o h e l p decode t h e Channel 
S t a t u s codes. 


L 
M 
S 
AUDIO DATA 
S V U C P 
B 
B 


There are 24 b i t s a v a i l a b l e f o r Audio Sample Data. 
T h i s may 
seem a l i t t l e e x c e s s i v e when most d i g i t a l s y s t e m s o n l y encode 
a n a l o g u e s i g n a l s t o 16 b i t s b u t w i t h improving t e c h n o l o g y 
24 


b i t s w i l l be required: 
The' f u l l a u d i o dynamic range from t h e 
t h r e s h o l d o f h e a r i n g t o t h e t h r e s h o l d of p a i n is a b o u t 120dB. 


( 2 4 b i t g i v e s a p r o x 144dB dynamic r a n g e ) 


Audio 
sample validity 
Digital audio 
Preamble or 
sample data 
User bit data 
sync group 
Audio 
channel status 
Auxiliary audio 
or other data 
Su bframe parity 


There is a n o p t i o n t o o n l y u s e 20 b i t s f o r d i g i t a l a u d i o and 
re-use t h e f o u r least s i g n i f i c a n t b i t s f o r o t h e r purposes. 
There is a p r o p o s a l t o u s e t h e s e f o u r b i t s as a low g r a d e 
t a l kbac k c h a n n e l , 


There are f o u r status b i t s w i t h i n e a c h frame as follows:- 


1 ) The PARITY BIT i s provided t o check f o r system 
r e l i a b i l i t y , 
It is n o t i n t e n d e d that t h e i n t e r f a c e w i l l be 
a b l e t o cope w i t h l a r g e numbers o f random e r r o r s as may occur 
on a l o n g r a d i o l i n k , 
Once two p i e c e s of d i g i t a l a u d i o 
equipment are o p e r a t i n g t o g e t h e r t h e y should c o n t i n u e t o 
o p e r a t e u n l e s s t h e r e is a system f a u l t . 


2 ) The VALIDITY BIT is set when a f a u l t is d e t e c t e d . 
The 
a u d i o sample c a n t h e n b e d i s r e g a r d e d and p o s s i b l y a f a u l t 
alarm r a i s e d . 


3) CHANNEL STATUS BITS are s t o r e d from s u c c e s s i v e d i g i t a l 
frames u n t i l 192 have been c o l l e c t e d , 
These s t a t u s b i t s 


c o n t a i n t h e f o l l o w i n g i n f o r m a t i o n about t h e a u d i o s i g n a l and 
t h e u s e of t h e i n t e r f a c e : - 


Consumer / P r o f e s s i o n a l use 
Audio / Non-audio ( d a t a ) use 
Emphasis on / o f f , and t y p e of emphasis used. 
Sampling frequency (48kHz / 44.lkHz / 32kHz) 
Mode - S t e r e o , two c h a n n e l o r s i n g l e channel (mono) 
Number o f a u d i o b i t s used (20 o r 24 b i t s ) 
Alphanumeric d a t a i n d i c a t i n g s o u r c e , d e s t i n a t i o n and 


t i m e o f day, 
E r r o r checking c o d e s f o r t h e c h a n n e l s t a t u s b i t s 


It should be noted t h a t n o t a l l equipment w i t h AES/EBU 
c o n n e c t o r s makes use of a l l t h e i n f o r m a t i o n t h a t is c o n t a i n e d 
w i t h i n t h e c h a n n e l s t a t u s b i t s . 
You may still have t o select 
parameters on a record machine s u c h as sampling rate and t y p e 
of emphasis used. 
A t p r e s e n t t h e alphanumeric d a t a is 
i g n o r e d by many p i e c e s of p r o f e s s i o n a l d i g i t a l a u d i o 
equipment. 


4) USER DATA b i t s can be used i n any way the d e s i g n e r of a 
piece of equipment wishes. The only advice given by t h e 
AES/EBU is t h a t t h e r e a r e design advantages i n using a 
s i m i l a r b i t s t r u c t u r e t o that of t h e channel s t a t u s b i t s . 
A 
p o s s i b l e use of t h e USER DATA b i t s is t o communicate the 
sub-code d a t a as recorded on compact d i s c s . 


The s t r e n g t h of t h e AES/EBU d i g i t a l audio i n t e r f a c e is t h a t 
it is a very f l e x i b l e system, it a l s o has p r a t i c a l drawbacks. 
The c o s t of including AES/EBU i n p u t s and o u t p u t s on equipnent 
can be expensive and t h e y are t h e r e f o r e ignored by many 
manufacturers of medium priced semi-professional equipment. 


TEE SONY/PEILIPS DIGITAL INTERFACE (SPDIF) 


SPDIF is t h e domestic v e r s i o n of t h e AES/EBU d i g i t a l audio 
i n t e r f a c e . 
The system is unbalanced and uses Phono 
connectors. 
The e l e c t r i c a l s i g n a l is lower i n amplitude than 
t h a t used by t h e AES/EBU i n t e r f a c e , being about 0.5 v o l t s 
(between 3 & 10 v o l t s f o r AES/EBU) . The d i g i t a l frame 
s t r u c t u r e is i d e n t i c a l t o t h e p r o f e s s i o n a l i n t e r f a c e but 
t h e r e are d i f f e r e n c e s i n t h e use of t h e channel s t a t u s b i t s 
which can cause d i f f i c u l t i e s wheo connecting domestic t o 
p r o f e s s i o n a l equipment. 


The d i f f e r e n c e use of t h e f i r s t few channel s t a t u s b i t s have 
t h e g r e a t e s t consequences f o r c o m p a t i b i l i t y between t h e two 
systems. For example: - 


B i t 0 (The f i r s t b i t ) i n d i c a t e s t h e p r o f e s s i o n a l o r domestic 
use of t h e Channel S t a t u s b i t s . 


P r o f e s s i o n q l p s e 


B i t 1 
Audio / Non-audio d a t a 
B i t 2 
Audio emphasis 
B i t 3 
I( 
II 


Domestic use 


Audio / Non-audio 
Copy p r o h i b i t 
Audio emphasis on/off 


I f a d i g i t a l s i g n a l from a p r o f e s s i o n a l recorder is plugged 
i n t o t h e i n p u t of a domestic RDAT, t h e recorder w i l l r e g i s t e r 


B i t 0 of t h e Channel S t a t u s a s i n d i c a t i n g t h e p r o f e s s i o n a l 
format. 
The 'COPY PROHIBIT' l i g h t w i l l come on i f an a t t e m p t 
is made t o record from t h e d i g i t a l i n p u t a s it is assumed 
that a l l p r o f e s s i o n a l m a t e r i a l should n o t be copied. 
I t w i l l 
still be p o s s i b l e t o record from t h e analogue i n p u t s . 


The same e f f e c t w i l l be seen i f an a t t e m p t is made t o record 
t h e d i g i t a l o u t p u t of a domestic CD player on a domestic RDAT 
machine. 
B i t 2 of t h e Channel S t a t u s prevents t h e d i g i t a l 
copying of commercially published recordings. 


I f a d i g i t a l s i g n a l is connected from domestic t o 
p r o f e s s i o n a l equipment B i t 0 may n o t prevent copying. The 
copy p r o h i b i t b i t , though, may be i n t e r p r e t e d a s audio 
emphasis codes. 


HAD1 
Multi-channel Audio D i g i t a l I n t e r f a c e 


MADI i s an extension of t h e AES/EBU d i g i t a l audio i n t e r f a c e 
which i s capable of handling up t o 56 channels. It is a n 
unbalanced system and uses standard video c a b l e s of up t o 50 


- metres f i t t e d w i t h BNC connectors. 


MADI i s intended f o r communicating between two pieces of 
equipment only and n o t f o r d i s t r i b u t i n g s i g n a l s (e.g. t o 
connect a d i g i t a l mixing desk t o a d i g i t a l m u l t i t r a c k 
recorder). A s i n g l e MADI l i n k would c a r r y s i g n a l s from t h e 
desk t o t h e t a p e machine and another l i n k would r e t u r n 
s i g n a l s f o r replay. 


Audio d a t a is handled i n a s i m i l a r format as t h e AES/EBU 
i n t e r f a c e except t h e preamble i s replaced by four MADI 
c o n t r o l b i t s . 


- 
1 Su bframe (32 Bits) 
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Madi channel active 
Digital audio 
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Audio 
channel status 


Su bframe parity 
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B i t 0 i n d i c a t e s t h e f i r s t channel (56 p o s s i b l e ) . 


B i t 1 is an i n d i c a t i o n from t h e sending equipment t h a t t h e 
following channel is i n use and c o n t a i n s v a l i d audio d a t a . 


B i t 2 t a k e s t h e place of t h e AES/EBU preamble t h a t i n d i c a t e s 
t h a t t h e audio d a t a is t h e A o r B channel of a s t e r e o p a i r . 


B i t 3 i n d i c a t e s t h e s t a r t of t h e f i r s t b i t of t h e 192 Channel 
s t a t u s b i t s i n t h e AES/EBU system. 


The d a t a r a t e of a MAD1 system is f i x e d a t 125M b i t s / s e c 
i r r e s p e c t i v e of t h e number of channels i n use o r t h e sampling 
r a t e . Additional b i t s a r e i n s e r t e d t o keep t h e b i t r a t e 
c o n s t a n t when t h e f u l l c a p a c i t y i s n o t being used. 


It is n o t p o s s i b l e t o connect t o g e t h e r d i f f e r e n t p i e c e s of 
d i g i t a l equipment u n l e s s a common i n t e r f a c e standard is used. 
S i g n a l parameters such as sample rate, number of b i t s p e r 
sample and p a r i t y a l l have t o be t h e same f o r two p i e c e s of 
d i g i t a l equipment t o c o r r e c t l y pass information between e a c h 
o t h e r . 


It is a l s o necessary f o r a l l p i e c e s of d i g i t a l equipment 
w i t h i n a s t u d i o t o be synchronised t o e x a c t l y t h e same 
d i g i t a l 'clock' r a t e . 
I f d i g i t a l equipment is n o t 
synchronised e r r o r s may be a u d i b l e a s c l i c k s o r t i c k i n g . 


The AES/EBU d i g i t a l audio i n t e r f a c e is becoming e s t a b l i s h e d 
as a sound i n d u s t r y standard. 
Balanced a u d i o c a b l e s a r e used 
w i t h XLR connectors t o connect equipnent t o g e t h e r . 
One c a b l e 
can be used t o c a r r y a s t e r e o s i g n a l , two channels o r a mono 
s i g n a l . 
Up t o 24 b i t s per sample of d i g i t a l a u d i o is c a r r i e d 
on t h e i n t e r f a c e along w i t h s t a t u s information about about 
sample r a t e , s i g n a l emphasis e t c . 


SPDIF is t h e domestic v e r s i o n of t h e AES/EBU i n t e r f a c e . 
The 
two systems d i f f e r mainly i n t h e use of t h e s t a t u s 
information which w i l l prevent domestic equipment copying 
p r o f e s s i o n a l recordings. 


MAD1 is a proposed multi-channel v e r s i o n of t h e AES/EBU 
i n t e r f a c e t h a t i s capable of c a r r y i n g up t o 56 channels of 
d i g i t a l audio. 


8 
D i g i t a l Broadcasting Systems 


D i g i t a l audio equipment has been used i n broadcasting f o r many 
y e a r s , long b e f o r e d e v i c e s such as Compact D i s c p l a y e r s and RDAT 
r e c o r d e r s were developed f o r t h e consumer market. 
In t h e 
s e v e n t i e s t h e BBC developed techniques f o r t h e d i s t r i b u t i o n of 
Radio and T e l e v i s i o n sound t o t h e t r a n s m i t t e r networks. 
This 
c h a p t e r d e s c r i b e s some of t h e i d e a s behind t h e s e systems and how 
t h e y have developed over t h e y e a r s . 


BARLY SYSTEM IN TELEVISION 


A d i g i t a l system f o r sound d i s t r i b u t i o n i n t e l e v i s i o n has been 
i n use s i n c e 1968 and was developed f o r reasons of sound q u a l i t y 
and economy. 
Using d i g i t a l techniques, it is p o s s i b l e t o 
d i s t r i b u t e sound multiplexed w i t h i n t h e video s i g n a l and s o save 
on t h e c o s t of s e p a r a t e sound c i r c u i t s . 


A t e l e v i s i o n p i c t u r e is formed by a beam scanning h o r i z o n t a l l y 
a c r o s s a t e l e v i s i o n tube. 
Each t i m e t h e beam reaches t h e r i g h t 
hand s i d e of t h e t e l e v i s i o n s c r e e n it is q u i c k l y returned t o t h e 
l e f t of t h e s c r e e n , only a l i t t l e lower down s o t h a t t h e s c r e e n 
is f i l l e d v e r t i c a l l y by repeated h o r i z o n t a l scanning. 
The beam 
is r e t u r n e d t o t h e l e f t of t h e s c r e e n i n a domestic t e l e v i s i o n 
set by a p a r t of t h e video s i g n a l known as a l i n e synchronising 
o r l i n e r s y n c f pulse. 
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f 
PICTUR 


I 
BLACK 


SYNCS 


TIME- 


I t is o n l y t h e l e a d i n g edges of t h e sync p u l s e s t h a t are 


r e q u i r e d t o i n i t i a t e t h e r e t u r n of t h e t e l e v i s i o n scanning beam 
t o t h e l e f t of t h e s c r e e n . 
The time occupied by t h e bottom of 
t h e sync p u l s e c a n be re-used f o r o t h e r purposes s u c h as t h e 
d i s t r i b u t i o n of a d i g i t a l sound channel. 
The sync p u l s e s need 
t o be c l e a n e d up b e f o r e t h e t h e v i s i o n s i g n a l is t r a n s m i t t e d s o 
that t h e r e l a t i v e l y simply designed domestic t e l e v i s i o n set 


w i l l f u n c t i o n c o r r e c t l y . 


T h i s d i g i t a l a u d i o system is known a s Sound i n Syncs o r SIS. 
The obvious advantage of SIS is that it s a v e s h i r i n g a network 
of analogue a u d i o l i n e s t o a l l t r a n s m i t t e r sites. 
The sound is 
a l s o "married" t o t h e v i s i o n , s o reducing t h e chances of t h e 
sound being t r a n s m i t t e d w i t h t h e wrong p i c t u r e s . 


Sound i n Syncs samples t h e a u d i o twice d u r i n g e a c h v i s i o n l i n e 
g i v i n g a maximum a u d i o f r e q u e n c y of 15kHz ( L i n e rate = 
15.625kH2, Sampling r a t e = 31.35kHz). 
I t i s t h e r e f o r e n e c e s s a r y 


t o send two a u d i o samples i n e a c h s y n c h r o n i s i n g p u l s e of t h e 
v i d e o waveform b u t t h e r e is o n l y enough room i n t h e a v a i l a b l e 
bandwidth t o send twenty b i t s . 
Each a u d i o sample can t h e r e f o r e 
o n l y be q u a n t i s e d t o t e n b i t accuracy. 


With a t e n b i t system t h e n o i s e would be f a r t o o a u d i b l e s o a n 
. 
analogue 
n o i s e r e d u c t i o n system is used t o improve t h e a u d i o 
q u a l i t y . 
The dynamic range o f t h e a u d i o s i g n a l is reduced by a 
compressor b e f o r e A t o D c o n v e r s i o n and t h i s has t h e e f f e c t of 
l i f t i n g q u i e t sounds above t h e q u a n t i s i n g n o i s e . 
The dynamic 
range of t h e a u d i o s i g n a l is r e s t o r e d a f t e r D t o A c o n v e r s i o n by 


t h e u s e of a n expander a t t h e t r a n s m i t t e r sites. 


Sound i n Syncs as developed by t h e BBC has been v e r y s u c c e s s f u l 
and has been used by many b r o a d c a s t i n g o r g a n i s a t i o n s . 
S e v e r a l 
g e n e r a t i o n s of equipment have been produced and t h e system i s 
o n l y now being r e p l a c e d because of t h e requirement f o r s t e r e o 
t e l e v i s i o n sound. 


EARLY SYSTEMS I N RADIO 


A d i g i t a l d i s t r i b u t i o n network f o r Radio has been i n s e r v i c e 
s i n c e 1972 and w a s i n t r o d u c e d because it was i m p o s s i b l e t o 
o b t a i n t h e r e q u i r e d number of s t e r e o l i n e s t o a l l t h e major 
t r a n s m i t t e r sites. 
Equipment was developed t o d i g i t a l l y combine 
t h i r t e e n a u d i o channels. 
T h i s was t h e o n l y s u c h system i n 
e x i s t e n c e a t t h e t i m e and it was known as The PCM System, ( P u l s e 
Code Modulation) o r j u s t PCM. 
T h i s equipment is being s l o w l y 
phased o u t and should be t o t a l l y r e p l a c e d by t h e e a r l y 1990s. 


Each a u d i o i n p u t channel t o t h e PCM system is sampled a t 32kHz, 
g i v i n g a maximum a u d i o f r e q u e n c y o f 15kHz. 
Each a u d i o sample is 
q u a n t i s e d t o 13 b i t a c c u r a c y and a s i n g l e p a r i t y b i t is added 


f o r error d e t e c t i o n . 
T h i r t e e n b i t d i g i t a l a u d i o is c o n s i d e r e d 
t o b e d e f i c i e n t by modern day s t a n d a r d s b u t a t t h e t i m e of its 
d e s i g n t h e l e v e l o f q u a n t i s i n g n o i s e w a s f a r below t h a t of a 
c o n v e n t i o n a l long d i s t a n c e analogue c i r c u i t . 


The reason t h a t t h e PCM system is being r e p l a c e d is because the 
d e s i g n is n o t f l e x i b l e enough t o allow f o r the expansion 
r e q u i r e d b y d e v e l o p n e n t s w i t h i n b r o a d c a s t i n g , s u c h as the 
i n t r o d u c t i o n of s t e r e o f o r Radio 1. 
With developments i n 
d i g i t a l t e c h n i q u e s it is a l s o p o s s i b l e t o m u l t i p l e x more a u d i o 
c h a n n e l s i n t o t h e same a v a i l a b l e bandwidth. 


PCM and t h e t e l e v i s i o n Sound i n Syncs are being r e p l a c e d by a 
d i g i t a l a u d i o system known as MICAW which is an acronym f o r Near 
I n s t a n t a n e o u s Companded Audio Multiplex. 


There are two f a c e t s t o NICAM; 


1 ) 
D i g i t a l companding which reduces t h e number o f b i t s 
r e q u i r e d by e a c h a u d i o sample. 


2 ) 
A m u l t i p l e x i n g s t r u c t u r e which e n a b l e s many d i f f e r e n t 
s i z e d d i g i t a l a u d i o networks t o be c o n s t r u c t e d . 


NICAM - A COMPANDING SYSTEM 


A h i g h q u a l i t y d i g i t a l a u d i o system needs t o sample t h e a u d i o t o 
a t least twice t h e h i g h e s t ' a u d i o frequency and t o a c c u r a t e l y 
q u a n t i s e t h e s i g n a l t o o b t a i n a low l e v e l of n o i s e . 
NICAM is a 


14 b i t d i g i t a l a u d i o system which u s e s a sampling rate of 32kHz 


\ . ( t o g i v e a maximum a u d i o frequency o f 15kHz). 
I t would be e a s y 
t o d e s i g n a system w i t h b e t t e r performance b u t t h e p e n a l t y f o r 
t h i s would be a h i g h e r bandwidth. 
A compromise has t o b e 
reached between t h e c o s t of a higher bandwidth and a u d i o 
q u a l i t y . 


NICAM makes use of s e v e r a l techniques t o reduce t h e number of 
b i t s needed t o be s e n t f o r each audio sample s o t h a t more 
information i n t h e form of d i g i t a l audio channels can be s e n t 
down a l i n e than would otherwise be p o s s i b l e f o r a given 
bandwidth. 


Above is a g r a p h i c a l . i l 1 u s t r a t i o n of an audio s i g n a l a g a i n s t a 
background of t h e q u a n t i s i n g l e v e l s produced during t h e 
conversion process t o a d i g i t a l s i g n a l . 


When t h e amplitude of t h e s i g n a l is high, a l a r g e number of 
l e v e l s are c r o s s e d by a s i g n a l and a l l t h e a v a i l a b l e b i t s are 
used by the s i g n a l . B u t when the amplitude of the s i g n a l is low 
only t h e l e a s t s i g n i f i c a n t b i t s are used along w i t h t h e most 
s i g n i f i c a n t b i t which i n d i c a t e s t h e p o l a r i t y of t h e a u d i o 
s i g n a l . 


When the audio s i g n a l is high i n l e v e l a l l of t h e d i g i t s may be 
used and t h e l e a s t s i g n i f i c a n t b i t s r e p r e s e n t low l e v e l a u d i o 
components. 
I n t h e presence o f h i g h l e v e l s , "Noise Masking' 
t a k e s p l a c e and any low l e v e l components are inaudible. 
It is 
t h e r e f o r e n o t necessary t o send these l e a s t s i g n i f i c a n t b i t s 
when t h e a u d i o l e v e l is high. 


When t h e s i g n a l is low l e v e l some of t h e d i g i t s d o n o t change 
s t a t e and a r e only needed t o i n d i c a t e t h e f a c t t h a t t h e s i g n a l 
is low i n l e v e l . 
I f it was p o s s i b l e t o i n d i c a t e t h i s i n some 
o t h e r way t o t h e receiving equipment, it would n o t be necessary 
t o send t h e s e unused b i t s . 


NICAM makes use of b o t h of t h e s e i d e a s and reduces 14-bit 
d i g i t a l a u d i o down t o a 10-bit system. 
When t h e audio s i g n a l is 
h i g h i n l e v e l only t h e t e n most s i g n i f i c a n t b i t s a r e s e n t and 
t h e four l e a s t s i g n i f i c a n t b i t s a r e dropped. 
This w i l l i n c r e a s e 
t h e l e v e l of q u a n t i s i n g noise b u t as t h e audio s i g n a l is high 
t h i s low l e v e l n o i s e w i l l be masked. 


NICAM is a d i r e c t e q u i v a l e n t t o an analogue companding n o i s e 
reduction system except t h a t t h e s i g n a l processing t a k e s p l a c e 
i n t h e d i g i t a l domain. 


To ensure s a t i s f a c t o r y decoding t h e l e v e l of t h e audio s i g n a l 
must be communicated t o t h e receiving NICAM decoder. 
This is 
done by sending a s e p a r a t e code known a s a range code and t h e r e 
are 5 l e v e l ranges as is shown i n t h e diagram below. 
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A t h r e e b i t code is required t o communicate which of t h e f i v e 
ranges the audio sample is i n . 
I f t h i s range code were added t o 
t h e audio sample a 14-bit system would have been reduced t o a 10 
+ 3 b i t system ( o r a 13 b i t system). 
This would be expensive on 
bandwidth s o a range code is only s e n t f o r every 32 samples 
(every 1 millisecond). 
The j u s t i f i c a t i o n f o r t h i s is t h a t 
although audio s i g n a l s are complex and unpredictable, r e a l 
programme material does n o t tend t o change i n l e v e l 
s i g n i f i c a n t l y i n such' a s h o r t i n t e r v a l . 
The range code s e n t by 
t h e NICAM coder is t h a t determined by t h e highest s i g n a l l e v e l 
i n t h e 32 sample period. 


It is l e v e l changes communicated only a t one millisecond 
i n t e r v a l s which makes NICAM a Near Instantaneous Companded 
system. 


NICAH FOR RADIO AND TBLEVISION 


The o r i g i n a l NICAM system w a s developed and i n s t a l l e d f o r t h e 


BBC Radio s e r v i c e where many high q u a l i t y d i s t r i b u t i o n c i r c u i t s 
are required. 
T h i s r a d i o d i s t r i b u t i o n network is based on a 
s t e r e o p a i r of d i g i t a l channels e a c h w i t h a b i t r a t e of 676k 
. b i t s / s e c and s o is known as NICAM 676. 


The NICAM system used f o r t h e transmission o f s t e r e o sound w i t h 
t e l e v i s i o n was designed t o f i t i n w i t h the proposed satellite 
broadcasting formats and r e q u i r e s a b e t t e r p a r i t y system t o 
o p e r a t e i n t h e d i f f i c u l t c o n d i t i o n s of the domestic environment. 
The b i t r a t e of t h i s system is 728k b i t s / s e c and is known a s 


NICAM 728. 
Both NICAM systems have t h e same a u d i o sampling 
r a t e s and reduce 14-bit coded samples t o 10-bit samples i n f i v e 
ranges. 
There are however many s i g n i f i c a n t d i f f e r e n c e s between 
t h e two systems, f o r example:- 


1 ) NICAM 728 samples t h e audio f o r b o t h l e f t and r i g h t channels 
simultaneously. 
NICAM 676 samples t h e audio channels 
s e q u e n t i a l l y , f i r s t t h e l e f t , t h e n t h e r i g h t and s o on. 


2 ) NICAM 728 adds one p a r i t y b i t t o e a c h audio sample t o check 
t h e s i x most s i g n i f i c a n t b i t s f o r t h e presence o f e r r o r s . 


NICAM 676 adds one p a r i t y b i t t o t h e f i v e most s i g n i f i c a n t 
b i t s of t h r e e samples. 


3) The Range codes of NICAM 728 are s e n t encoded i n t o t h e p a r i t y 
b i t s of t h e a u d i o samples. NICAM 676 sends range codes as a 
number of s e p a r a t e b i t s s c a t t e r e d around t h e audio samples. 


. Many many o t h e r t e c h n i c a l d i f f e r e n c e s e x i s t which make it 
impossible t o d i r e c t l y connect between t h e two NICAM systems 
without a g r e a t d e a l of d i g i t a l processing. 
If it is ever 
necessary t o l i n k t h e two systems t h e s i m p l e s t s o l u t i o n is t o 
connect analogue o u t p u t s t o analogue i n p u t s . 


NICAM DELAY 


Because of t h e s i g n a l processing involved there i s a s h o r t d e l a y 
between an audio s i g n a l e n t e r i n g a NICAM system and being 
decoded back i n t o audio. 
This d e l a y can cause o p e r a t i o n a l 
problems. 


The d e l a y produced by NICAM is mainly due t o t h e way t h e range 
codes are protected a g a i n s t e r r o r s . 
A range code i s evaluated 
from t h e highest amplitude sample i n a block of 32 samples. 
I t 
is t h e r e f o r e necessary t o s t o r e 32 samples before t h e range code 
is c a l c u l a t e d producing an i n i t i a l delay of one millisecond. If 
a range code is i n c o r r e c t l y decoded t h e r e s u l t i n g l e v e l change 
introduced i n t o t h e audio s i g n a l w i l l be d i s a s t r o u s . 


With NICAM 676 t h e range code b i t s a r e s e n t i n groups of t h r e e 
t o g e t h e r with e r r o r p r o t e c t i o n b i t s . 
This causes an even 
g r e a t e r delay and t h e same time is required t o check f o r e r r o r s 
when t h e NICAM s i g n a l is received. 
The t o t a l d e l a y from a piece 
of programme e n t e r i n g a NICAM 676 coder t o a audio s i g n a l being 
received a t t h e end of a l i n k i s 13 milliseconds. 
This d e l a y i s 
s h o r t but some announcers f i n d it very d i f f i c u l t t o t a l k i f t h e 
cue programme i n their headphones has even s h o r t delays. 
This 
has caused problems, f o r example a t some l o c a l r a d i o s t a t i o n s 
where t h e announcer l i s t e n s t o t h e s t a t i o n output v i a an off a i r 
receiver. 
It is t h e r e f o r e important, i f a NICAM l i n k is used i n 
some part of t h e broadcast chain, t o be aware of p o s s i b l e 
problems and be prepared t o use a c l e a n feed t o an announcer's 
headphones. 


If more than one l i n k i s used, as i n t h e c a s e of a n o u t s i d e 
broadcast t o a s t u d i o c e n t r e and from t h e s t u d i o c e n t r e t o t h e 
t r a n s m i t t e r chain NICAM d e l a y s w i l l accumulate. 
For example a t 
t h e s t u d i o c e n t r e the d i g i t a l s i g n a l w i l l have t o be received 
and mixed w i t h o t h e r programme sources and then re-encoded t o 


form another NICAM s i g n a l which is t h e n s e n t on t o t h e 
t r a n s m i t t e r . 
This w i l l produce a t o t a l d e l a y o f 26 m i l l i s e c o n d s 
from o u t s i d e broadcast t o t r a n s m i t t e r . 
It is p o s s i b l e t o keep 
t h e o u t s i d e broadcast programme i n a d i g i t a l form throughout its 
passage t o t h e t r a n s m i t t e r b u t it w i l l be necessary t o 
"unscramble" t h e range codes b e f o r e mixing can t a k e place. 
T h i s 
process w i l l add t h e normal NICAM delay. 


There is a l s o a d e l a y a s s o c i a t e d w i t h NICAM 728 when used f o r 
s t e r e o sound w i t h t e l e v i s i o n b u t as t h e range codes are encoded 
d i f f e r e n t l y w i t h i n t h e d i g i t a l a u d i o s i g n a l t o t h a t of NICAM 676 
t h e t o t a l d e l a y is g r e a t l y reduced t o between 4 and 6 mS. 


NICAM MULTIPLEXING STRUCTURE 


NICAM is designed w i t h f l e x i b i l i t y i n mind and is based on t h e 
b i t r a t e s of t h e i n t e r n a t i o n a l standard d i g i t a l b e a r e r network. 
The b a s i c u n i t of t h i s network is a system t h a t c a r r i e s 30 
telephone channels w i t h a b i t r a t e of 2048k b i t s per second. 


NICAM uses t h i s same b i t rate t o send s i x broadcast q u a l i t y 
a u d i o channels. 


A s i n g l e NICAM channel samples t h e incoming audio a t 32kHz and 
e a c h sample is coded, w i t h d i g i t a l compression, t o 10 b i t 
accuracy. 
With range codes, p a r i t y b i t s and synchronisation b i t s 
t h e t o t a l b i t rate f o r a s i n g l e channel is 338k b i t s per second. 


The b a s i c u n i t of t h e multiplexed NICAM hierarchy i s a 676k b i t s 
per second coder p a i r . 
These two channels can be used f o r 
s t e r e o working o r as two s e p a r a t e mono channels. 
Six channels 
can be combined as t h r e e pairs t o form a s i x channel block. 
These s i x channels p l u s more synchronisation b i t s produce t h e 
2048k b i t per second s i g n a l required t o f i t i n t o t h e d i g i t a l 
bearer network. 


It is p o s s i b l e t o b u i l d a l a r g e r d i g i t a l network by adding 
t o g e t h e r s i x channel NICAM blocks and i n f a c t a 24 channel 
system is planned. 
This w i l l r e q u i r e a b i t rate of 8448k b i t s 
per second. 


1 Channel 
2 Channels 
6 Channels 
up t o 24 Channels 


> I - 


(i; 
6 C l n n e l b l o c k s ) 


NICAM t h e r e f o r e o f f e r s a l a r g e number of compatible o p t i o n s when 
designing a d i g i t a l audio network f o r broadcasting. 


STEREO SOUND FOR TELBVISION 


Transmission of s t e r e o sound is n o t a new i d e a and has been 
c a r r i e d o u t using analogue techniques f o r r a d i o s i n c e t h e 1960s. 
Several c o u n t r i e s have developed s t e r e o sound f o r t e l e v i s i o n by 
adding an e x t r a FM sound carrier j u s t above t h e normal sound 
c a r r i e r . 
When s t e r e o sound is t r a n s m i t t e d , one sound c a r r i e r 
carries t h e l e f t hand information and t h e o t h e r t h e r i g h t . 


VISION SIGNAL 


Vision 
Secondary 
Carrier 
Primay 
Soun 
Sound 
Channel 
Channel 


Such a system is a b l e t o o p e r a t e i n a s t e r e o sound mode o r i n a 


two channel sound mode, p r o v i d i n g f a c i l i t i e s s u c h as a b i l i n g u a l 
sound coverage. 
The d i s a d v a n t a g e i s t h a t t h e viewer w i t h a n 
unmodified mono r e c e i v e r w i l l o n l y hear one c h a n n e l of a s t e r e o 
b r o a d c a s t and n o t a mono s i g n a l . 
The presence of t h e second 
sound carrier i n c r e a s e s t h e c h a n c e s of i n t e r f e r e n c e w i t h t h e 
v i s i o n s i g n a l o f a d j a c e n t t e l e v i s i o n c h a n n e l s . 


The system of t r a n s m i t t i n g S t e r e o Sound f o r T e l e v i s i o n t h a t has 
been chosen by t h e BBC is a d i g i t a l one t h a t o f f e r s two high 
q u a l i t y sound c h a n n e l s w i t h none o f t h e problems a s s o c i a t e d 
w i t h analogue methods. 
T h i s is t h e d i g i t a l system known as 


NICAM 728. 


The d i g i t a l NICAM s i g n a l i s t r a n s m i t t e d on a s e p a r a t e c a r r i e r a t 
a h i g h e r frequency t h a n t h e e x i s t i n g mono analogue sound. 
The 
viewer w i t h a mono o n l y r e c e i v e r still r e c e i v e s t h e analogue 
sound channel which c o n t i n u e s t o c a r r y mono sound. 
The viewer 
w i t h a s t e r e o set r e c e i v e s h i g h e r q u a l i t y sound from t h e d i g i t a l 
sound carrier. 
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I n t e r f e r e n c e from t h e d i g i t a l s i g n a l t o a d j a c e n t p i c t u r e s i g n a l s 
is p a r t i c u l a r l y n o t i c e a b l e when t h e i n t e r f e r e n c e is d i r e c t l y 
r e l a t e d t o t h e l e v e l of sound. 
The d i g i t a l s i g n a l t r a n s m i t t e d 
is t h e r e f o r e scrambled s o as t o g i v e t h e s i g n a l a random n a t u r e , 
and it i s o n l y l i k e l y t o produce an i n c r e a s e i n background 
p i c t u r e n o i s e i n extreme cases o f i n t e r f e r e n c e . 


I 


With NICAM 728 s e v e r a l d i f f e r e n t a l t e r n a t i v e o p t i o n s are 
a v a i l a b l e : 


1 ) A S t e r e o Sound S e r v i c e 
o r 
2 ) A Two Channel Sound S e r v i c e 
o r 
3 ) A S i n g l e Sound Channel + A Data Channel 
o r 
4 ) A S i n g l e Data Channel 


The use t o which t h e Data c h a n n e l c o u l d be p u t has n o t as y e t 
been d e c i d e d b u t the system i s f l e x i b l e enough t o a l l o w f o r 
f u t u r e expansion. 
A t p r e s e n t it is o n l y planned t o use t h e 
S t e r e o Sound s e r v i c e . 


STEREO SOUND IN SYNCS 


The Sound i n Syncs system d e s c r i b e d earlier, samples a mono 
a u d i o s i g n a l a t twice t h e v i d e o l i n e rate and i n s e r t s two l o - b i t 
a u d i o samples i n t o e a c h v i d e o l i n e sync p u l s e . 
The a u d i o 
sampling rate and t h e number of b i t s used f o r coding are 
r e s t r i c t e d by t h e bandwidth of t h e v i d e o l i n e s t h a t are used. 
I f a l a r g e r number of b i t s w a s used f o r coding, o r t h e sampling 
rate i n c r e a s e d t h e r e s u l t i n g d i g i t a l s i g n a l would become 
d i s t o r t e d by t h e v i d e o l i n e s . 
T h i s c o u l d r e s u l t i n t h e a u d i o 
system becoming u n r e l i a b l e o r a t worse, n o t working a t a l l . 
It 
could be s a i d that t h e Sync p u l s e i s ' f u l l up1 w i t h d i g i t s when 
mono SIS is used. 


The advent of s t e r e o sound f o r u s e w i t h t e l e v i s i o n produces t h e 
problem of how t h e a u d i o s i g n a l is t o be d i s t r i b u t e d t o t h e 
t r a n s m i t t e r sites. 
A Sound i n Syncs system is v e r y a t t r a c t i v e 
as t h e a u d i o q u a l i t y is good and t h e r e is no extra c o s t i n l i n e s 
f o r d i s t r i b u t i n g t h e sound s i g n a l . 
The problem is t h a t t h e 
t h e r e is no a v a i l a b l e space t o squeeze i n t h e e x t r a 20 b i t s 
needed f o r a second a u d i o channel. 


Reducing t h e sampling rate s o that e a c h a u d i o channel is sampled 
o n l y once d u r i n g e a c h v i d e o l i n e would reduce t h e maximum a u d i o 
frequency of t h e system t o about 7.5kHz. 
T h i s is f a r from 
s a t i s f a c t o r y f o r a q u a l i t y s e r v i c e . 
Reducing t h e number of b i t s 
used t o code e a c h a u d i o sample would make t h e n o i s e performance 


comparable t o t h a t of a t e l e p h o n e l i n e . 


The s o l u t i o n a r r i v e d a t was t o abandon t h e i d e a of a simple, 
ON/OFF, 
b i n a r y system f o r t h e s i g n a l t h a t is i n s e r t e d i n t o t h e 
sync p u l s e and r e p l a c e it w i t h a system using f o u r d i s c r e t e 


v o l t a g e l e v e l s t o r e p r e s e n t i n f o r m a t i o n . 
Twice t h e amount of 
i n f o r m a t i o n as a b i n a r y system c a n be c a r r i e d i n t h e same 
bandwidth. 


Each v o l t a g e l e v e l r e p r e s e n t s t h e value of two binary d i g i t s . 


The disadvantage is t h a t it is now much e a s i e r f o r a n o i s e s p i k e 
t o change t h e value of t h e received d i g i t a l s i g n a l . 
Mono SIS 
d i d not r e q u i r e any p a r i t y e r r o r p r o t e c t i o n . 
This is n o t t h e 
c a s e , though, f o r a f o u r l e v e l system which has a much lower 
n o i s e immunity t h a n a b i n a r y system. 


NICAM is used a s it provides a q u a l i t y audio d i s t r i b u t i o n system 
w i t h 10 b i t s per audio sample, a reasonable e r r o r p r o t e c t i o n 
system and removes t h e need f o r analogue n o i s e reduction a c r o s s 
t h e SIS system. 


FUTURE DEVELOPHJ3NTS 


The b i t rate of a d i g i t a l a u d i o system sampled a t 48 kHz and 
quantised t o 1 6 b i t accuracy is about 700k b i t s / s e c . 
It is 
p o s s i b l e t o more t h a n halve t h i s rate by using NICAM f o r 
. communication l i n k s . NICAM makes s e v e r a l compromises t o achieve 
t h i s b i t rate reduction such a s using a lower sampling r a t e and 
a d i g i t a l companding system. 


Research is being c a r r i e d o u t by s e v e r a l manufacturers t o 
achieve even g r e a t e r r e d u c t i o n s i n b i t r a t e i n d i g i t a l audio 
systems than t h a t of NICAM. 
Both Dolby and SSL produce 
communications equipment which condenses a 15kHz q u a l i t y audio 
channel down t o o n l y 128k b i t s / s e c . 
This i s e q u i v a l e n t t o t h e 
b i t r a t e of only two telephone channels. 


In each of t h e a v a i l a b l e systems t h e d i g i t a l processing is 
complex and r e l i e s on s p l i t t i n g t h e audio range up i n t o 
frequency bands and only coding t h a t information which is 
a b s o l u t e l y necessary. 
The e f f e c t i v e n e s s s of t h e s e systems has 
s t i l l t o be decided a s w e l l a s t h e e x t e n t t o which t h e 
techniques used might a f f e c t t h e audio q u a l i t y , i f a t a l l . 


Development of t h e s e techniques is continuing and it is claimed 
t h a t it should be p o s s i b l e t o reduce t h e b i t r a t e s of t h e s e 
systems t o only 64k b i t s / s e c . 
T h i s is t h e b i t r a t e of a s i n g l e 
telephone channel and even i f it was only s a t i s f a c t o r y such 
equipment could have many uses w i t h i n broadcasting, s u c h a s 
l i n k s from s p o r t s OBs o r news r e p o r t i n g from remote l o c a t i o n s . 


D i g i t a l audio systems have been i n use w i t h i n t h e BBC s i n c e t h e 
l a t e 1960s f o r t h e d i s t r i b u t i o n of network sound t o t h e 
t r a n s m i t t e r s . 
Within t h e t e l e v i s i o n s e r v i c e t h i s w a s c a r r i e d 
o u t using a system known as Sound i n Syncs t h a t multiplexed a 
s i n g l e d i g i t a l audio channel w i t h i n a v i s i o n s i g n a l . 
Another 
system w a s developed f o r r a d i o t h a t combined 13 d i g i t a l a u d i o 
channels s o as t o provide a method of d i s t r i b u t i n g s t e r e o r a d i o 
throughout t h e country. 


Newer d i g i t a l d i s t r i b u t i o n systems f o r r a d i o and t e l e v i s i o n 
sound have been developed based on a technique known a s NICAM. 
This reduces t h e b i t rate requirements of a q u a l i t y audio 
channel by sampling t h e audio a t a reduced r a t e and by 
compressing t h e q u a n t i s i n g from 14 b i t s down t o 10 b i t s . 


The Radio d i s t r i b u t i o n network i s based on a NICAM system w i t h a 
b i t r a t e of 676k Bits/sec f o r a s t e r e o p a i r of channels. 
NICAM 
p a i r s o f channels can be added t o g e t h e r t o form a 6 channel 
block t h a t f i t s i n t o t h e i n t e r n a t i o n a l s t a n d a r d telephone b i t 
rate of 2048k Bits/sec. 
I t is planned t o b u i l d up t h i s system 
t o form a 24 channel d i g i t a l d i s t r i b u t i o n network. 


The t r a n s m i s s i o n of s t e r e o sound w i t h t e l e v i s i o n is based on 
a 


NICAM system w i t h increased p a r i t y and e r r o r p r o t e c t i o n t o h e l p 
t h e system o p e r a t e io' t h e o f t e n d i f f i c u l t c o n d i t i o n s of t h e 
domestic environment. 
This NICAM system has a b i t r a t e of 728k 
B i t s / s e c f o r a s t e r e o p a i r of channels. 


Appendix 1 
Binary Numbers 


The familiar, decimal, counting system uses the ten different 
digits 0 to 9. To enable counting up to any number, digits are 
arranged in rows with the position of the digit indicating a 
weighting factor which is some power of 10: 


1000s 100s 10s units 


Binary counting works in exactly the same way, except that it 
uses only two different digits, 0 and 1. 
The weighting factor 
indicated by the position of the digit is now a power of 2: 


32s 
16s 
8s 
4s 
2s 
units 


So c o u n t i n g i n b i n a r y g o e s l i k e t h i s : 


Decimal 
Binary 


I t might seem t h a t impossibly l o n g s t r i n g s of d i g i t s are needed 
t o c o u n t u s e f u l numbers. 
However, 16 BInary d i g i T S or BITS c a n 
c o u n t up t o t h e decimal number 65,535 and 32 b i t s c a n c o u n t to 
o v e r 4 b i l l i o n - u s i n g roughly t h r e e times a s many d i g i t s a s t h e 
decimal e q u i v a l e n t i n e a c h c a s e . 


The v a l u e o f b i n a r y numbers is s i m p l y t h a t t h e t w o d i f f e r e n t 
d i g i t s c a n e a s i l y b e r e p r e s e n t e d by t h e ON or OFF states of a 


t r a n s i s t o r s w i t c h i n t h e e l e c t r o n i c l o g i c c i r c u i t s t h a t are t h e 
b a s i s of a l l d i g i t a l systems, whether t h e y are computers, alarm 
c l o c k s or Compact D i s c p l a y e r s . 


Appendix 2 
Coding Systems 


An audio signal has both positive and negative values. 
These 
can be expressed in a number of different ways in digital form. 
The three important methods are offset binary, twos complement 
and sign and magnitude. 


+ peak 


+1 


zero 


-1 


- peak 


offset 
binary 


11 11 
1110 
1101 
1100 
1011 
1010 
1001 


1000 


0111 
0110 
0101 
0100 
0011 
0010 
0001 
0000 


They are compared below: 


twos 
complement 
sign & 
magnitude 


Note that all three methods make use of the leftmost or 'Most 


Significant1 bit to indicate the sign of the number - it is 


therefore known as the 'Sign bit'. 


Offset binary simply counts up from the negative peak through 


'zero' to the positive peak. 


Offset binary can be converted to twos complement by inverting 


the sign bit. 


The most widely used system is twos complement. 
Twos complement 
numbers possess the useful property that any positive number 


added to its complementary negative number results in zero (if 


the carry is ignored). 
This is desirable if the digital audio 


signal is going to undergo any signal processing such as mixing. 


For example, +3 + (-3) : 
0011 


The allowable negative range of the signal is one quantising 


interval greater than the positive range when using offset 


binary and twos complement. 
This is of no concern in a 


practical system. 


